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Chairpersons’ message 


On behalf of the Melecon 2000 organizing committee, we are pleased to welcome you 
to the 10™ Mediterranean Electrotechnical Conference on Information Technology 
and Electrotechnology for the Mediterranean Countries. An exciting and stimulating 
technical program was made possible, with the presentation of a total of 292 papers 
from scientists coming from 36 countries, covering the most recent technological 
advances on information technology and electrotechnology. 


Throughout the world, electrotechnology and information technology form the basis 
of an entirely new era in professional practice and the delivery of services. It is 
characterized as a revolution based on information, hoping to improve the quality of 
life. This is especially important in the Mediterranean region, which bridges countries 
from three continents, Europe, Asia and Africa. Thus, the special theme of the 
conference has been selected to be: /nformation Technology and Electrotechnology 
for the Mediterranean Countries. The papers of Melecon 2000 will be presented 
under the following sessions: 


Regional Communication and Information Technology (106 papers) 

High Performance Networking, Control & Optimization, Communications/Network 
Design, Intelligent Networks, Network Reliability & Quality of Service, Broadband 
Network Communications I, Il, Communication Systems, Optical Systems & 
Networks, Internet Services & Applications, Wireless & Mobile Communications I, 
II, III, Internet Services I, II, and Electron Devices. 


Signal and Image Processing (108 papers) 

Neural Networks I, II, Intelligent Systems, Image & Multidimensional Signal 
Processing I, II, Speech & Sound Processing, Filters, Signal & Image Processing I, II, 
Transportation Engineering, Signal Processing, Circuits & Systems I, Il, Digital 
Signal Processing I, II, and Communication Systems DSP. 


Power Generation Supply and Renewable Resources (78 papers) 
Distribution, Power Quality, Transmission I, II, II, Machines I, I, Generation, 
Environmental/Energy Conservation, Control I, II and Renewables I, II. 


Furthermore, the IEEE Region 8 Student Paper Contest will take place during 
Melecon 2000 where 6 finalists will compete. 


We wish to express our thanks to all the contributors who submitted papers to 
Melecon 2000 for review. It goes without saying that we could not have had such a 
high quality technical program without their contributions. In addition, we would like 
to thank B. Zajc and J. Baal-Schem for their support in making this conference a 
reality. 


Also, we would like to express our sincere thanks to C. Economides, G. Malliotis, S. 
Panis, and C. Stasopoulos, for their dedicated efforts during the abstract review 
process and the preparation of the technical program. Moreover, we would like to 
thank P. Symeou, Y. Frangoulides, 8. Spyrou, M. Papaconstantinou, A. Constantinou, 


iii 


K. Diamantides, S. Soteriou, N. Michaelides, and A. Kellas for their dedicated work 
and support in materializing this conference. 


Last but not least, we would like to thank E. Polycarpou and M. Avraam for their 
exceptional technical assistance in managing the paper handling and the electronic 
communications of the conference, as well as in preparing the advance program, and 
the conference proceedings. Moreover, we would like to express our sincere thanks to 
K. Mavromati, C. Hadjiyianni, S. Evstathiou, and Y. Mylonas for their additional 
technical assistance 1n organising this conference. 


We hope that Melecon 2000 will offer opportunities for professional growth and 
establishing new contacts with colleagues. Our intention is to do all we can to make 
your participation in the conference worthwhile and your stay in Cyprus enjoyable. 


C. S. Pattichis 
C. N. Schizas 
L. S. Savvides 


May 2000 


Principal Sponsor’s Message 


My partners and I would like to welcome you to this 10” Mediterranean 
Electrotechnical Conference, Melecon 2000 on Information Technology and 
Electrotechnology for the Mediterranean countries. It is evident from this highly 
regarded technical programme that most of you will be presenting papers on 
Information Technology and Electrotechnology that are on the edge of what is 
currently possible. 


Throughout the history of industrial development, our standards of living and the way 
we conduct business have been reshaped by significant breakthroughs in technology, 
just like the ones you will be presenting. A vivid example of this high technology 
application is e-business or e-commerce which has been driven by leading edge 
technology achievements in information technology. 


From day one, PricewaterhouseCoopers have identified the critical role that this 
leading edge technology can play in the business of tomorrow. The most fundamental 
dynamics of the market place that is customers and markets, competitors and 
alliances, policy and regulation, emerging technologies and businesses are in constant 
evolution and e-business is at the heart of these changes. 


We at PricewaterhouseCoopers have proven that high technology can effectively be 
harnessed, in this case in the form of e-business, and translated into opportunities for 
our business partners to outpace the competition and succeed in today's new economy. 


We wish that you have a successful Melecon 2000 conference and hope that it will be 
an opportunity for professional advancement. 


Yours faithfully 
Meda) yore? 


Michael H Zampelas 
Chairman and Managing Partner 


PRICEWATERHOUSE(COPERS 


Julia House, 3 Themistocles Dervis Street, 
CY-1066 Nicosia, 

P.O.Box 21612, 

CY-1591 Nicosia, Cyprus 

Telephone: + 357 -2- 555000, 

Facsimile: + 357 -2- 555001, 

Internet: www.pwcglobal.com 
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Markov Process Model Cache Algorithm, 
Dedicated to Network File Servers 


Stefan Stancescu IEEE Member, Sorin Adam 


An algorithm is presented, dedicated to an WEB server 
intended to substantially improve the page delivery 
service based on the statistically collected data regarding 
frequency of hyperlink demand. The WEB page delivery 
service is modelled as a Markoy process where the most 
valuable demands are automatically ordered in a three- 
dimensional page space based on a second order Markov 
process model. 


Index terms—Markov process, network server, 
cache algorithm 


I. THE CONCEPT OF MARKOV PROCESS MODEL 
APLIED TO NETWORK FILE SERVER DEDICATED TO WEB 
SERVICES 


The HTML (Hyper Text Mark-up Language) is 
designed to build hypertext form of data 
representation for a global distributed knowledge 
database related to a complex subject by assembling 
with links elementary information data (web pages). A 
WEB server is a process that administers a hypertext 
database in view to send a hypertext page upon 
request as soon as possible. A WEB browser is a 
client process accessing a WEB server co-ordinated 
database, searching for selected data dictated by the 
each browser scope. Usually, each WEB server own 
governed database services a large number of WEB 
browser clients [1]. 

Each WEB server uses mechanisms to 
optimise the service for a large number of clients 
searching in a set of HTML pages, with 
simultaneously demands associated with each 
appropriate scope. Most of these mechanisms are 
borrowed from classical file servers in view to 
improve response time [2]. 

The WEB server activity may be modelled as 
a Markov process by attaching to each hyperlink in his 
own hypertext, the elementary conditional probability 
to be selected by one of WEB browser clients, if the 
browser arrived in precedent hyperlink. The Markov 
chain may be build by concatenating hyperlinks. The 
first order Markov process includes only the data 
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relative to one step forward event. The successive 
order Markov processes take in view information 
related to successive events. 

A way to obtain a reduced access time to web 
pages is by using data caches. Auxiliary data caches 
lower the network traffic by providing local storage of 
frequent accessed pages [3]. 

The caches reside in the web browsers and/or 
in web servers. In this paper we are interested only on 
data cache mechanism applied in web servers 
implemented in network file servers. 

The data caches implemented in web server 
processes the statistics of data access in view to build 
an optimal strategy to co-ordinate web page supply to 
web browsers. A server model coupled with data 
statistics are the support for the decision to be taken in 
each action that foresees the following demands. 


First order Markov process cache model 


We have analysed the Markov process model used in 
[4] that optimises the processing time in 
microprocessor data caches. This model is restricted to 
first order by the difficulties to implement in hardware 
the support for complex processed decisions. 

We can use a similar model to a cache 
system with more relaxed time restrictions as those 
permitted in the demande-response activity between 
WEB browsers and servers. In such a case the 
processing time and the memory page space offers the 
possibility to implement more complex software 
algorithms (and extend to higher degrees 
Markov process models). 


II. MARKOV MODELS OF WEB SERVER ACCESSING 
PROCESS 


General Markov model of WEB server 


A WEB sever processes a database of HTML linked 
pages. The hyperlinks in one page may direct the web 
browsers arrived there to a set of another linked pages 
(first order linked with the current one). Each such 
linked page will be labelled with the “first order” 
label, F1. The process is iterated in each page 
accessed by a F1 labelled page. In each HTML page 
resides hyperlinks to successive HTML pages. Each 
hyperlink founded in F1 labelled pages are labelled at 
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their turn by the “second order” labels F2. In such of 
way we obtain also the successive labelled pages 
“third order” F3 and so on. Is evident that the number 
of links grows exponentially at each level. 

Each labelled page is associated with a data 
structure storing statistical data, updated at each 
browser access. These mechanisms may be similar 
with those used by current operating systems in 
managing pages in virtual memories. An aging 
mechanism corrects the updating process of 
information [3], in view to discard old accessed pages. 
Finally, this data structure will keep the web server 
informed about current frequency of accesses of the 
clients to each page. In this way each Fn-labelled page 
will be associated with the nth elementary conditional 
probability to be selected by one of WEB browser 
clients arrived on the hyperlink origin that access this 
nth HTML page. 


First order Markov model for WEB servers 


According to [4], where is stored the first order 
information of processor cache miss hits, in the WEB 
server Case we memorise in appropriate data structures 
the frequencies of access normalised as conditional 
probabilities of transitions from one page to another. 
All this associated information is processed by the 
WEB server optimised access mechanism that 
implements a prediction algorithm for selection, 
conform the Bayes criterion. Selected pages will be 
uploaded before the effective page asks, estimating 
that these ones will be the most solicited. 

The WEB server optimised access 
mechanism may be implemented in software as will 
be described further. A dedicated server process will 
service each browser access, witch will associate a 
new shareable variable for each new WEB page 
solicited keeping the absolute address of the page and 
the frequency of the accesses. This dedicated server 
process will update the frequency access indicator if 
the variable is associated already by a precedent 
demand of another WEB browser. An overall control 
process manages all these shared variables, which 
sorts them on the frequencies of accesses, using 
conventional software. This mechanism includes well- 
known software sorting algorithms. 


Second order Markov model for WEB servers 


Based on assumption that the time restriction on WEB 
server case is more relaxed than the case of processor 
cache described in [4] we can improve the decision 
regarding uploading pages in advance by adding more 
information to the decision process. 

In consequence, we refine the Markov model 
to second order. The disadvantage of the first order 
model is that, once arrived in an HTML page, 
different browsers may be conducted by different 
scopes. The choosing of the following page may be 
decided by different reasons. In view to refine the 


decision process, we include in selecting criteria the 
precedent page of the browser route. 

For the variable attached to a labelled page, 
we associate, for each accessed hyperlink in this page, 
a subsequent set of dependent variables double 
labelled with the label of the “mother” variable and 
those of the pointed HTML page. The double-labelled 
page will store the frequency of the accesses of the 
last page trough the labelled page. This is accordingly 
to the conditional probability of the last page to be 
accessed only by “mother” labelled page. 

The decision mechanism based on the second 
order Markov process model consists in selecting 
preferred target page for a browser that has passed by 
a precedent and a current page, by selecting the 
maximum frequency access value stored in double- 
labelled variables generated by the current “mother” 
labelled page. 


III. THE THREE DIMENSIONAL ORDERED PAGE SPACE 
OF LINKS 


In view to retain essential information needed for 
accurate determination of the most valuable 
succession of two links, we can extract from the web 
server only the page names and the frequency of 
accessing it independently or conditionally. Each 
accessed page is labelled and is associated with a data 
structure in order to retain access information. A link 
is represented as a point in bi-dimmensional page 
space where axes mark the labelled pages. Each mark 
in the bi-dimmentional page space represents a 
hyperlink from one labelled page to the “mother” 
page. From the “mother” page we find, by subsequent 
hyperlinks, the double-labelled pages. These last 
pages may be represented on a third axis and the final 
point will be determinated by a three-dimmensional 
parameter. All the pages represented on the third axis 
will be ordered on the access frequency corresponding 
to the conditional probability of finding this page from 
the “mother” page. The ordering mechanism will be 
activated when the browser is residing in a “mother” 
page. The two-dimmensional initial page space is 
constructed by accessing new pages. The classical 
ordering algorithm is started for each point in the two- 
dimmensional initial page space. 

The information stored in a three- 
dimensional page space model is exploited by the 
WEB server which, for each browser arrived in two- 
dimensional page, page uploads the “child” page 
detected by the name deposed in the first cell of the 
third axis of the three-dimensional page space model. 


A. The two dimensional Markov process model 
example 


Let be a WEB server that manages six web pages with 
links (figure 1) and the associated transition matrix 
(figure 2). The ordering algorithm act only on the 
orizontal directions from left to right, obtaining the 
pages to be uploaded next to the let vertical axis. 
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Fig. 1: WEB pages with links 


Fig. 2: The transition matrix for WEB pages with 
links for two dimmensional Markov model 


As described previously, the access 
mechanism for the two-dimensional Markov model 
selects the page to be uploaded in the left side (figure 


Fig. 3: Ordered labelled pages after the sorting of the 
rows of the transition matrix for WEB pages for two- 
dimmensional Markov model 


B. The three dimensional Markov process model 
example 


eR Fig.4: The six planes of the three dimensional model 
For th WEB site with let the WEB 
La wet pert Ra ee oN og with the conditional probabilities for the second 
ordered Markov process model 


browser services maintain the current conditional 
probabilities matrix associated with each hyperlink as 
presented in figure 4. The face plane may be the same 
as the two-dimensional Markov matrix process model 
with the only condition for the sum on the horizontal 
rows to be one and the non-zero positions to be 
unchanged. 
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Fig.5: The six planes of the three dimensional model 
with the conditional probabilities after the sorting 
algorithm for the second ordered Markov process 

model 


In the figure 5, the planes with ordered WEB 
pages are presented. It is evident that for plane 
corresponding to plane A, the face plane is moved 
around the left vertical axis and only the nonzero 
positions may be modified, resulting the left side 
plane. All vertical planes corresponding to rest of 
pages are obtained by translating this first vertical 
plane with eventually modifying only positions with 
nonzero conditional probabilities. 

The ordering algorithm acts only on each 
face bottom axis, obtaining the pages to be uploaded 
next to the front page in the three-dimmensional 
Markov process model. 


IV. THE SIMULATION RESULTS 


For random generated WEB site structures combined 
with the random generated access demands the second 
Markov process model acted to determine the next 
page to be uploaded in advance. In all cases an 
upgrade in speed is obtained for the medium access 
time per page. The best access is obtained with 
heterogeneous distribution of conditional 
probabilities. 


V. CONCLUSIONS 


Based on first degree of Markov process design for 
processor caches, a second order Markov process 
model is used to implement software optimisation 
algorithm implemented to improve the response time 
of WEB servers to simultaneously WEB browser 
demands. The software implementation of the 
algorithm is possible by the relaxation of time 
constraints in processing WEB pages ina WEB 
server, compared with high-speed decision that need 
to be made in a processor cache. Considering higher 
degree Markov process models may refine the 
algorithm. This will be done in future works. 
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A DISTRIBUTED MEDIA SERVER 
MANAGEMENT SCHEME 


Sotirchos Stavros, Koziris Nectarios, Papakostantinou George 


Abstract In a distributed multimedia system, the 
distribution of digitized video, audio or other type of 
streamed data presents problems in both data 
communications and management of media servers. The 
system should be able to server users in a wide geographical 
area and to adapt itself to a varying number of users. 

In this paper, we propose an internet-based management 
schome for a media server system, assembled by a variable 
number of media servers geographically dispersed, where 
user requests are directed dynamically to an appropriate 
server according to their location and served in a unicast 
wiry. 

The scalability of the system is based on multicast type of 
negotiation between servers and shifting the management of 
content and load from centralized gateway to servers it selves. 


Index Terms—media servers, VoD 


I, INTRODUCTION 


In recent years the advances in storage capabilities, CPU 
performance, encoding techniques and communication 
technologies made it possible to store, process and transfer 
enormous amounts of information in digital representation. 
An increasing number of applications like video-on- 
demand (VoD), teleconferencing. telemedicine, 
tcleteaching, teleshopping, news-on-demand or digital 
libraries are starving for multimedia systems that provide 
all the services needed by them, 

Multimedia applications span a large spectrum of 
interaction, processing and communication dynamics. At 
the different parts of the spectrum different aspects of the 
system play the critical role in determining the service 
quality, In other words, a scenario that would be ideal for 
an authoring system could be extremely expensive for a 
video conferencing system and vice versa, 

The main design problems for the development of such 
systems is to ensure the just-in-time delivery of media 
elements in order to avoid interruptions or glitches in the 
flow of data, to maximize the quality of the service and to 
Minimize resource utilization at the client site. Therefore, 
such a media system has to cope with mixed-workloads 
containing Continuous data requests (e.g. for video and 
audio) and discrete data requests (e.g., for text or image 
data) from the users, The behavior of the media server 
visible to One user must not depend on the requests of other 


users. Parallelism is inherent to these systems due to the 
independence of the retrieval of different streams. 

The majority of the existing implementations are based on 
a single media server. There are also distributed solutions 
that use a centralized gateway server to direct the user fo 
pre-defined servers. In this paper, we propose a distributed 
media server scheme where users are directed dynamucally 
to the appropriate media server according to user location 
and network load. This method might be an improvement 
to current distribution methods. 

The rest of the paper is organized as follows: In section 3 
we review the underlying theory and the related work in 
the field of media servers. The overall architecture of our 
system is described in section 4 and the paper is concluded 
in section 5. 


Il. REVIEW AND RELATED WORK 


A. Applications 


Large-scale deployment of interactive multimedia services 
is a complex problem. The very nature of multimedia data 
combined with the inherent parallelism implied by the 
required transparency of the system to user requests are 
only two of the parameters that render this problem quite 
complex. 
Generally, multimedia applications can be broadly 
classified under two major categories — persistent and non- 
persistent where the word persist refers to the nature of the 
data at the source and/or destination. Video-on-Demand 
(VoD) is an example of the former and videoconferencing 
on the other hand is an example of the latter type. These 
applications require from the media system to support a 
number of services that can be categorized as follows 
according to the different types of content and services: 
e Support for linearly continuous multimedia streams, 
e.g. movies. 
This is the simplest way of retrieving multimedia 
information. The movie or sound sample is a 
continuous stream of information and the user can 
navigate forwards and backwards in the stream. 
e Support for non-continuous multimedia information, 
e.g. News. 
More complicated than the previous, this way of 
retrieving multimedia information gives the ability to 
the user to make more unpredictable control operations 
when retrieving the content. Information is often 
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sound. Morcover, this information is possibly stored m 
different databases. The client her is not only able to 
retrieve this content and make mapredictzble jaraps 
from database to database but he/she cam also add some 
new material into the database. 

Sapport for very larg: non-comtinuces multimedia 
hypertext content. ¢.¢_ Multimedia Ebraries. 

In this case multimedia clements are not only 
augmented in length bet is mumber too. Thus. those 
elements are probably diswibuted scogrephically = 


(VCR) quality the encoding speed of 22 MPEG1 
stream is 1 SMb/s —3Mb/s which mcans that 2 memety 
mimetics long movie which is digtized at SOOKB's 
requares about 1.6 GB of storage. The sewer MPEG-2 
coe chia « Apsoging gianna Se 


ncarty 3.5 GB. Tits smplc m 
al Ua ahs nad Nc: Meads of 
Sass 

Large mussber of simultamesas access 

For a real-time viewing of the comient. te storage 
systems munst be ablic to tramsfex the stream af the samc 
ial 2s i wes emooded to 2 mamber of posmibic wsexs 
Laser is the matin member Of sivcames teal 2 server 
cam Siomulezmconsty soerve. the saore is the summer of 


users that i cam serve. Depends om the mecdea sez 
im addetion & will also depend on dec type of the mcd 
service. Ths Ematatios cas pemibiy be overcame by 
duplscatme titles of by sme varios: cae 
opiiemzatsons. Tine member of wsexs remicvieg dhe same 
title cam be caanced cvest wp 30s by came 


data vary due to the comsent and appiication_E would be 
useful, if all media serwer archiscoumres would sappart ail 
different service types referred abowe. Nevertheless. m the 
peal world imteractive multimedia sysuems spem 2 gamer of 
types, depending om the degree of macractivity allowed In 
[0] imeeractive services are classified mio broadcast. pz}- 
per-view, quasi VoD_ sear VoD and wae VoD_ m acencine 


osder of imtcractivity zvaiiable_ mechmiques called banca [5] auc cae! aliccaricn 
[et 
BR Medea Servers © Reizabeiecy and Avaiiehiiey 


Lite ay other Kid of server. 2 media server mst Be 
muse Gino @ is @ sumane kanes. Ts acai 
means Geet the mca Gem berecen Gale (MIST) Er 
te complicte syssem decreases wien Ge sysic 
becomes more complies. Ome componct einch & of 
particular imapertames im 2 medka-sever & the sures 
sustom_ The MIB? for an average Gsk Gove & aot 
em the semiber of sass ues. Tete. mete 
sexwers Snonld be mpliememed m 2 wes thet smee- 


im [2] there are three major categories: ssorege prowadier. 
content provider aad network prowider. Mecse savers 
belong to the primary category thet of the storage provider 
A media sewer described m [5] comsists of 2 smraae device 
“geray_ 2 server undi and seweral user semoms. The scoraze 
dewice array uscally comsisis of an wray of Gui Groves A 
user station self commaims 2 Set Topas (STB) ¥ 
decodes by hardware the encoded dams delnwered by he 
Server unit. 2 terminal with 2 spezke= box. and 2 betr_sdThe 
_ serwer encodes the dase and collects the encoded Gaz of = 
“several pictures im blocks. A sequence of blocks of PORES OE ares are Ea wo acl 
continuous dara such 2s vdeo and sndio thm S Geiversd > cnoouraass Ge Gepiememaior of Gsrees 
@user station is called a sream_ Video and ano ardanccares and arorscoovery alecciiis and disk 
ouamaaons bbe RAW) 
| te sawe some storage capacity and transmission bandwidth. e « Abslity te kendle real-cime end poo reel-cime traffic 
Wile the peers fonction of 2 es Peace 
standardized encoding techmague for video sxcums. meds seve is seve muni: ealine Ge 
| A media has to handle very large dame waste as a sreams simmitencossiy_ & most also te adic 1 ae 
| Stawage space requirements. which can act be mumascd Dy 2 SeGSiaciory Service op mom reece Gace Such See 
Simgje processor. Whale 2 workstatim wpe of SS cae would be aacommicsed on he course of downicadime 
| gapport o few tems of sieaultamcoes muitimecis sweamns 2 new programs fom semiine: and somo sees 
f | maim or ome of the mam servers. This socossty cam be mare macibesat posonal asats (7) 
— ewidemt if we mention the important requirements of sact = © Fast geal processes ia ; 
ee ee ee aes 


Clearly, a multi-user multimedia server is a complex 
machine with stringent requirements. The satisfaction of 
these requirements is well beyond the realm of personal 
computers or individual workstations. A natural choice for 
a media server is a high performance computer consisting 
of multiple processors, connected by a high-speed 
interconnection network and optimized for fast I/O. 
Complementary, views have been expressed to this effect in 
the context of high performance relational database systems 


[8], [9]. 


C. Current Status 


Industry and academia are racing against time to construct 
the information superhighway. Companies like 
Oracle/nCUBE, Intel and IBM are trying to provide on- 
demand services by using distributed memory 
multicomputers as servers. Others like Silicon Graphics, 
Digital and Hewlett Packard are using multiprocessor 
technology to build media servers. At Berkeley University 
of California the Plateau multimedia research group is 
developing a distributed video-on-demand system that will 
be capable of storing thousands hours of video material. 
The Berkeley hierarchical video server uses tertiary storage 
to store thousands of hours of video material. The tertiary 
storage cannot support simultaneously display of content 
(referred as continuous media objects, CMOs), and the 
CMOs must be loaded from archive servers to video file 
servers from where CMOs can be viewed real-time. This 
implementation ignores the network load parameters in 
order to select the file server to load a requested video. 
Another implementation that comes from a company this 
time is IBM’s two-tier clustered video server. 

IBM’s two-tier clustered video server architecture is 
designed to support thousands of concurrent streams. IBM’s 
implementation enhances the performance of a centralized 
type of media server mainly. 


III]. DISTRIBUTED MEDIA SERVER MANAGEMENT SCHEME 


A. General 


The physical implications of the requirements of a high 
performance media server are just beginning to be 
understood. Very little work on using a parallel machine as 
a media server has been reported. Moreover, most of the 
existing scaling and distribution methods use a centralized 
manager, which knows the servers current load and content. 
In this paper we propose a management scheme for a 
distributed media server system. The difference of this 
scheme from the ones that already exist is that, the core 
network bandwidth consumption is reduced since users are 
served by the nearest server. Moreover, with the proposed 
method, if the manager or administration service fails to 
Operate on a media server, the whole system does not go 
down 


B. Description 


The proposed distributed media server system is based on a 
group of servers distributed in the network. These servers 


can belong either to a Local Area Network (LAN) or to a 
Wide Area Network (WAN). Users are accessing the on- 
demand services of such a system using commonly used 
computers with no special hardware or software. Using a 
fast network connection and optionally an MPEG-decoder 
card to improve to quality of the MPEG streams the user 
can request for a video, audio or other data from a standard 
World Wide Web (WWW) browser software. Once the 
video, audio or data stream (e.g. electronic books) is 
selected for viewing, it appears on user’s display in the 
video, audio or other playing application. The distribution 
of servers and the selection of appropriate server is invisible 
to users. 


C. Implementation 


The software components of the whole system can be 

distinguished into two major categories, client software and 

server software. The standard WWW browser as well as the 
necessary video, audio or other type of player belong to the 
former category while the standard WWW server and the 
media server software belong to the later. 

The WWW server provides an application gateway to the 

service. This gateway program is of course independent of 

the media servers and can be situated on the same or 
different machine than the media servers. All the 
knowledge about stream and server location is on the media 
server itself, thus making the system scalable. Media 
servers select the most appropriate server for the client and 
the selected server starts sending the content to the user. 

The media server consists of the following software 

components: 

e Media agent, which manages the overall media server 
functions and consists of 
e Media Request Broker, a thread or process that 

does the actual data transfer from disk via network 
to the client. The video, audio or other player 
controls the transfer-rate by signaling flow control 
messages. 

e Network Probe, that measures the number of 
routers between the server and client (hops) and 
the average packet delay between the server and 
the client. 

e SQL database, which is used to store and retrieve 
information concerning the requested streams, like 
transfer-rates, file locations on disks, information about 
the user, etc. 


The core algorithm which is the selection scheme of the 
appropriate media server consists of the following steps: 


User Request 
Phase 


The user requests the video, audio or 
data stream. 

2. The WWW server sends a message 
(FindServer) using multicast to a set of 
registered media servers in order to 
find the best media server available. 
The media servers belonging to the 
same multicast group receive the 
search message and initiate the 


Network 
Negotiation 
Phase 
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following queries: 
e Search the SQL database for the 
requested stream. 
Is there available disk bandwidth 
to serve the request exist? 
Does available concurrent data 
stream exist to serve the request 
exist? 
If the necessary stream or disk 
bandwidth is not found, it drops the 
request. 
Otherwise, proceeds to the 
measurement of hops and the average 
packet delay between the server and 
the user machine. 
Next the media servers send each 
other via multicast their network 
statistics for the client. 
After a predefined time period waiting 
for statistics they timeout and rank 
those statistics according to general 
rules that follow in descending order : 
i. less hops 
ii. less average packet delay 
iii. largest average disk bandwidth 
iv. less continuous data streams 
The media server that finds out to be 
on the top of the ranking, sends a 
consolidation to the WWW server. In 
case of two or more competing servers 
that claim to be the fastest, the first is 
selected only. The server that finds out 
to be second from the top of the 
ranking, sends a consolidation as a 
backup server to the WWW server. 
Playback Next the WWW server sends a 
Phase PlayData message to the media server 
selected as primary so that the media 
request broker starts the stream is sent 
to the user. The same time, the WWW 
server sends s PlayMirrorData 
message to the backup media server. 
While the selected media server serves 
the user, the backup media server 
probes the primary media server to 
ensure that it is working ok. In case 
the primary media server fails, it tales 
over and continuous serving the user 
after an acceptable glitch due to the 
swap. 

. After the client has finished viewing 
the stream, the media request broker 
ends and the reserved resources at the 
media server are freed. 


The following figure explains the selection algorithm 
described above. 


video, audio or data Application Gateway 


stream Send a FindServer 
¢ message to media 
i al servers via multicast 


User request for video, “WY Sever Send a PlayData 
audio or data stream. message to the best 


Sa media server. 

Running @ Send a PlayMirrorData 
www Browser message to the second 

* Windows, Unix 


best media server. 
Os 


Internet 
Backbone 


SQL 
Database 
Media Server 1 Media Server 2 


* Search SQL database * Search SQL database 
for stream for stream 
* Query for acceptable * Query for acceptable 
disk bandwidth and bandwidth. 
available streams 
4) Measure network performance and 4] Measure network performance and 
exchange statistics with other exchange statistics with other 
media servers via multicast media servers via multicast 
Collect statistics and if in top Collect statistics and if in top 
5 rank send a consolidation 5} rank send a consolidation 
message to WWW Server. If message to WWW Server. If 
second, send consolidation as second, send consolidation as 
backup. backup. 


If a PlayData is received, 
reserve resources and send 


If a PlayData is received, 
reserve resources and send 


stream to client. stream to client. 
7) If a PlayMirrorData is received, Q If a PlayMirrorData is received, 
reserve resources and probe reserve resources and probe 


continuously the primary server continuously the primary server 
if itis up and live. if it is up and live. 


Figure 1 


New servers can be added to the system relatively easy, the 
new server is added to the multicast group and the media 
agent must start. 


ITV. FUTURE WORK AND CONCLUSION 


The current implementation in this paper is a proposal of a 
distributed media server management scheme that provides 
a scalable solution for distributed media server 
architectures. 

The next step would be the implementation of a simulation 
environment where it would be possible to collect valuable 
statistical data about the performance of such a 
management scheme. 

Moreover, the implementation lacks the ability to 
dynamically manage the content between media servers. 

In this paper we have tried to propose a new schema for the 
management of a distributed media server system. 
Specifically, we described a scaleable system that directs 
user requests for video, audio or data streams dynamically 
to an appropriate server according to the user’s location and 
server’s bandwidth utilization. 
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Mathematical Approach to Designing Switched 

LAN’s. An Alternative Solution to Compute the 

Loss Probability in an Heterogeneous Traffic 
Environment 


Esteve Pallarés-Segarra ' Student Member, IEEE and Joan Garcia-Haro 2 Member, IEEE 


Abstract--With the integration of different multimedia 
services and the growing demand of bandwidth, Local 
Area Networks (LAN) are becoming switched networks. 
A rigorous and systematic design of switched LAN 
topologies requires, among other issues, to size up the 
capacity of the links connecting switches to appropriately 
support the new services. This paper proposes an 
alternative mathematical methodology to compute the 
loss probability obtaining more accurate results than 
other approaches in the related literature. The different 
services are modeled using different classes of ON-OFF 
sources and the capacity of the links have to guarantee a 
given information loss probability constraint. 


I. INTRODUCTION 


The deployment of new telematic services requiring 
high amounts of bandwidth is impacting on 
conventional telecommunication networks as well as 
on their resource dimensioning and _ planning. 
Traditional networks are not well suited to support this 
growing demand of bandwidth. This fact was first 
manifested in Local Area Networks and the easier 
solution was the increase of link capacity and their 
segmentation in several sub-LANs. Consequently, 
LANs that originally were shared-medium, shared- 
bandwidth ones are becoming switched networks. 

Classical topology design methods for switched LANs 
do not consider the integration of different and 
multimedia services in their traffic models to evaluate 
the capacity of the transmission links connecting 
switches. A solution was proposed in [1] considering 
the different services among final hosts to calculate the 
required capacity to support them. Each service is 
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modelled by a set of ON-OFF traffic sources [6]. The 
resulting traffic is the aggregation of R different 
classes of sources with @,; equal ON-OFF sources 
each one. Depending on the number of sources in the 
ON period of every class we have z different states, 
being z: 


R 
z=] [@, +1) (1) 


A finite buffer is used to characterise the access to the 
network, and its maximum buffering space represents 
the maximum delay allowed before the information 
enters to the network. The quality of service (QoS) 
requirement constraint is given by the information loss 
probability. Therefore, it is necessary to calculate the 
capacity of a pipe between the final hosts that 
guarantees the QoS requirements. 

In this fluid-flow model we can define the loss 
probability as the ratio between the average lost fluid 
and the average generated fluid. The average generated 
fluid can be evaluated as 


yz, F, (2) 
i= 


Being F, the probability the system is in the state i, 
and B; the fluid generated by the ON sources in this 
state. 


R 
nl{O) mar 
R 
B= ys i; (4) 


Where: 
— 5, 1s the number of sources of class j in the ON 


state. It depends on the system state i. 
ipl is is the probability that a source of class j is in 


the ON state. 
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—  q;: is the probability that a source of class j is in 


the OFF state. 
-  i;: is the fluid intensity generated by a source of 


class j in the ON state. 


The main computational difficulty is in the evaluation 
of the average lost fluid. It can be calculated as 


Being: 

— c: the capacity of the pipe. 

— mz: the buffer size. 

- F. (x): the prob(state i and the content of the 
buffer > x). 


If we define F;(x) as the probability the system is in 
state i and the content of the buffer is lesser or equal 
than x we can calculate F,(x) since the sum of F;(x) 
plus F(x) equals to F.. From [2], it is known that 
F;(x) satisfies the system of differential equations 
given by (6), 


De ire): O<x<m (6) 


where D is the diag(B, —c,B,—c,...,B,-—c) and M 


is the infinitesimal generator of a continuous-time 
Markov process. 


rj denotes the transition rate from state i to state j. 
If the arrival process is finite-dimensional and time- 


reversible, then D~'Mis known to have a basis of 
eigenvectors and the solution to (6) can be written as 


(8): 
fell a 
F(x)= a,c", (8) 
j=0 


Being @, the right eigenvector corresponding to the 
eigenvalue A ;- The determination of the a j elements 
depends on boundary conditions: 

If B;<c the buffer never fills therefore F;(m) = F, ; 

If B; >c the buffer never empties and thus, F,(0) =0. 


The determination of the a j coefficients becomes a 


complex computational task [2]. Therefore, in this 
paper one of our main contributions is the proposal of 
an alternative mathematical methodology to obtain the 
average lost fluid using some of the properties of D 
and M matrices that notably simplifies the problem 
and additionally provides more accurate results than 
other analytical solutions. 


II. MATHEMATICAL APPROACH PROPOSED 


Since the matrix M is the infinitesimal generator of a 
continuous-time Markov process, the sum of its 
columns is always zero. Its determinant equals zero 


and the determinant of D™'M too. This implies that 
the characteristic polynomial has the zero root, being 
A=0 one of the z eigenvalues. It is known that in a 


continuous-time Markov process MF = 0, where F is 
the vector of state probabilities. Therefore, F isa right 
eigenvector associated with the zero eigenvalue for the 
matrix M as well as the matrix D~'M. Taking into 


account these considerations we can derive the 
following expression: 


z-l 
F(x)=aF + Yaje"®, (9) 
jal 


Being aoF the term associated to the zero eigenvalue. 


On the other hand, if we construct a vector where its 
components are the elements of the diagonal of D, it 
can be easily demonstrated that it is a left eigenvector, 


associated to the zero eigenvalue of the D™'M matrix. 
Then, if we compute the escalar product of a left 
eigenvector with a right’ eigenvector the result is O if 
they are associated to different eigenvalues [5], 
therefore 


4 


YB; — cF; (x)= ay > (B; ais 
apo h i=] . (10) 

a je" S'(B; - 0) =49 > (B, - oF, 
j i=1 i=l 


Jal 


Following this, the result achieved is independent of x, 
and using this expression to calculate the average lost 
fluid we obtain: 


average lost fluid = 


= VIB, - Fi (m)= 0-4.) (8, - oF, (11) 
i=l =) 


Finally, the loss probability is given by (12). 
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loss probability = (a —1) 
i=l 


Considering this result the computational complexity 


to determine all the a j values has been reduced to 


obtain only the coefficient associated to the zero 
eigenvalue and more accurate results can be obtained. 

In addition, the states can be sorted according to the 
generated fluid, being the first state the one with all the 
sources in the OFF state (no fluid generation) and the 
last state, the state with all the sources in the ON 
position (maximum fluid generation). This is the order 
of the states used in the homogeneous case [3]. Then, 
we can define the T state as the greatest state for which 
the fluid generated is lesser than c, therefore the 


boundary conditions can be stated as F;(m)= F. if 
is<T and F,(0)=0 if i>T7. The system of linear 
equations is now expressed as: 


zl 
A ; 
af; +) aje a i= iS<T 
jal 
z-l 


jal 


(13) 


In this case, it is possible to reach a closed expression 
for the ag coefficient using the Cramer’s rule. 


ao = 7 : ( 14) 
Ycv' F, - det (minor(F.)) 


i=T+1 
T 


Yc 1)! F, -det (minor(F,)) 


i=l 


1+ 


The disadvantage of the expression we achieved (14) 
is that it is necessary to calculate all the eigenvalues 
and all the eigenvectors of the D™'M matrix to solve 
the problem. 

Obviously we are only interested in pipe capacities 
greater than the average flow generated, that is, 


v4 
c> > BF, (15) 
i=l 


Under this circumstance and considering (12) the 


value of ag is bounded between the margins given by 
(16). 
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Se (16) 


Using this expression we can observe that the ag 
coefficient tends to 1 when c increases. Unfortunately, 
this fact also increases the computational complexity, 
because we cannot take advantage of the floating point 
numerical representation since the mantissa always has 
the value one in its integer part. However, it is possible 
to solve this problem arranging appropriately the 


system of linear equations (13) to obtain the a, 


values. If we subtract the value F, in both members of 
the ith equation of the linear system the a) unknown 
quantity becomes the a)-—1 unknown quantity. 
Applying again the Cramer’s rule we obtain: 
ay -1=——- = (16) 
yp! F, - det (minor(F,)) 


i=l 


Sp F, - det (minor(F, )) 


i=T+1 


hae 


As we see in the formula (12) this is the quantity that 
we are looking for at all. Therefore, with this new 
arrangement we can achieve more accurate results than 
other traditional methodologies of computing the loss 
probability. 


III. RESULTS 


We have computed the loss probability for different 
mixes of traffic using both, the classical [1] and our 
improved method. The values of f,,, tog and fluid 


intensity for each class of traffic are taken from 
reference [4], and they are reproduced in table 1. 


eee | oa 
Video 

2 fetaton| M0 || 

| 3 | 30 | 2 | 
File 
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Table 1. Classes of traffic used in the different mixes. 


Color fax 


As we stated before, computing the loss probability 
with the improved method gives more accurate results, 
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without appearing underflow problems. Figure 1 
depicts the loss probability with the improved method 
for a mix of two sources of class 2 and two sources of 
class 3. It compares the results for three buffer sizes of 
50, 100 and 200 respectively. 


CAPACITY 
10 


———- 


LOSS PROBABILITY 


Se a 

Fig. 1. Mix with two sources of class 2 and two 
sources of class 3 for several values of the 
buffer size m. 


We can observe that services requiring higher amount 
of bandwidth behave, or produce a mask effect over 
other services and the loss probability decreases more 
quickly when the capacity value of the pipe 
approaches the maximum intensity of the service 
under consideration. In figure 1 the video telephony 
service that needs 10 Mbps masks the color fax service 
of 2 Mbps. 


CAPACITY 
21 21.5 22 22.5 23 23.5 24 


1.0E-01 


1.0E-10 


fe loss probability 2 -~-loss probability 1 ] 


Fig. 2. Underflow problems. Loss probability 1 is 
computed by using the classical method and 
loss probability 2 is calculated with our 
improved method. 


Figure 2 compares both methods for the same mix of 
figure 1 and a buffer size of m=100. When the capacity 
reaches the value of 19.9 Mbps underflow problems 
arise whit the classical methodology but very accurate 
results are obtained with our improved methodology. 


IV. CONCLUDING REMARKS 


A mathematical approach to calculate the loss 
probability in a buffer using a fluid-flow model has 
been proposed. The approach is based on _ the 
properties of the matrices D and M reducing the 
calculation of z coefficients to the calculation of only 
one coefficient, that is, the ag coefficient. 


A closed expression for the ag coefficient has been 


also derived. A further proposal to arrange the system 
of linear equations to solve dg allows to take more 


advantage of the floating point numerical 
representation obtaining much more accurate results. 
The classical methodology has been compared with 
the proposed one that highly improves the accuracy of 
the results. Some results with different traffic mixes 
have been also shown to support our claims, 
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Abstract-Software agent technology is a new and an 
emerging technology. It is an active research area today. 
Generally, a software agent is defined as an autonomous 
software entity that can interact with its environment. Mobile 
agents represent a special class of agents capable to transport 
itself from one environment or platform to another. Agents of 
this type can be successfully used to assist in complex 
telecommunications network and system management tasks 
such as: operation monitoring, load balancing, rebooting, 
failure anticipation, problem analysis, etc. Moreover, mobile 
agents allow to decrease pressure on centralized network 
management systems by delegation specific management 
activities from the central operations system to spatially 
dispersed management agents. 


The expected benefits represent a motive to better 
understand how to develop applications using agent 
technology. Information modeling is one of the most 
important phase in such development. It concerns with 
identification and location of information in the systems and 
the description of information processing activities, but 
completely independent of the system implementation. In this 
paper we explain our approach to information modeling of 
management applications using mobile agents. The approach 
is based on the usage of two modeling languages: Unified 
Modeling Language (UML) with adequate extensions as well 
as Object Constraint Language (OCL). 


Index Terms— information modeling, mobile agents, tele- 
communications management, UML, OCL 


I. INTRODUCTION 


Traditional network management scheme was centralized. 
One host in the configuration has the role of a network 
management station. The remainder of the devices on the 
network contains agent software and information base to 
allow monitoring and the control from the management 
station. Too much burden is placed on single management 


station, and there is too much management traffic to be 
carried from every single agent to management center and 
back. 


Decentralized, distributed management approach works 
better in the circumstances of large managed systems, e.g. 
telecommunication networks. Various management roles 
are dispersed through multiple management stations. This 
type of architecture spreads the processing burden and 
reduces total network traffic [2]. 


Today a number of new and powerful distributed 
processing technologies exist like CORBA (standardized 
middleware for distributed object-oriented applications), 
JAVA (platform independent object-oriented programming 
language), MAF (emerging standard for mobile agents 
system) [5]. These new technologies are inspired searching 
for new management system architectures (e.g. CORBA- 
based management systems [6]) as well as developing new 
techniques for modeling and development of the systems. 
The most important characteristics of these new 
middleware-centric management systems will be uniform 
applications of object-oriented modeling and programming 
paradigm to both management and managed part of the 
systems (accordingly to the Telecommunications 
Information Networking Architecture - TINA), [7]. 
Furthermore, approach to information modeling of the 
middleware-centric systems need to be implementation and 
protocol neutral. 


Information modeling strategy together with information 
models of an application using mobile management agents 
with dynamically extensible behavior are described in the 
next section. Following modeling aspects are emphasized: 
structure and static binding, dynamic binding and 
migration. Information specifications are translated and 
prototypical implemented on ObjectSpace's Voyager agent 
platform [4] in Java. Results are shortly described in the 
last section. 
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II. INFORMATION MODELING STRATEGY 


The mobile agents technology is a new technology. Mobile 
agent systems responsible for handling agents developed by 
various software companies differ widely in architecture 
and implementation, preventing interoperability. Our 
research in this turbulent area of distributed application 
with mobile agents has been directed in accordance with 
two basic principles: 


e Conceptual model of agent system is the model defined 
by OMG in [5]. It is object-oriented and independent of 
any available implementation architecture. 

e Conceptual space of the mobile agents with dynamic 
aggregation and reclassification abilities is specially 
focused on. The dynamic abilities is expected to be 
intensively applied in telecommunications management 
software systems. 


Therefore, mobile agents should be designed with main 
assumption that they can never exactly know what types of 
instances representing resources will be present at certain 
subsystem (on their itinerary) on which management tasks 
should be performed. To be useful in such and other 
management situations our agents expose ability to 
dynamically extend their behavior. 


Modeling strategy will be explained on an example with 
following scenario: an agent is used by many subsystems, 
each of which requires the agent to be adapted 
(customized) in a subsystem-specific way (Fig 1). Such 
scenario imposes many important modeling problems 
which also exist in more sophisticated scenarios (e.g. 
where adaptation is made conditionally, depending on the 
state of destination subsystem). 


Mobile Agent Specialized Part1 
@@\-—"T . Managed Subsystem MS1 
SS F location L1 


Manager 


Manager Subsystem ia a (22 
location M - 


Mobile Agent Specialized Part n 
Managed Subsystem MSn 
location Ln 


Figure 1: Scenario for the mobile management agent 
usage. 


Manager object (Fig. 1) creates and sends a Mobile 
Agent object on itinerary to specified location i={/,..., n}. 
Subsystem-specific object Specialized Part i is 
dynamically attached to the Mobile Agent object on its 
arrival to the location. The Mobile Agent object's 
behavior is customized i.e. adapted to management 
operations needed in managed subsystem on the location i. 


The Mobile Agent object and Specialized Part i 
object form a dynamically created aggregation. During its 
staying at Manager Subsystem, Mobile Agent object is in 
mobile agent role and presents IMobileAgent 
interface to Manager object. At location i, the Mobile 
Agent is in the role of base agent. After the 
aggregation is established, the Mobile Agent is in the 
role of adapted agent and presents IAdaptedAgent 
interface. 


Three modeling aspects are particularly considered and are 
described in detail in the following subsections. These 
aspects are: structure and static binding, dynamic binding 
and object migration. 


A. Structure and Static Binding 


An UML class diagram (Fig. 2 and 3) is useful for 
modeling structure, relationships, and the static binding of 
an abstraction (i.e. abstract type) to its interface. The main 
modeling construct in presented class diagrams - 
"stereotyped class", really represents a type of information 
bearing entity, not a class. The entity is not a software 
object but an abstraction identified during information 
modeling. This is the reason while we use class symbol, 
stereotyped as type, in diagrams on Fig. 2 and 3. Later, in 
the design and implementation phase of application 
development, these entities will be implemented more or 
less directly as software objects. Consequently, objects 
which participate in interactions shown in Fig 4 are only 
indirect (UML-term prototypical) instances of the abstract 


types. 


<< type>> 
Manager 


Responsibility 
association reas -- ready to be moved to 

another location 

<< interface>> 

iMobileAgent 

+goToLocation(Location) 
Figure 2: Static interface binding - mobile agent role 
(UML class diagram). 


When an abstract type like Mobile Agent participates in 
an association, it has a specific role. A role is just the face 
an abstraction presents to the abstraction at the other end 
of the association. The role is labeled as mobile agent. 
In the role, Mobile Agent object presents only the 
IMobileInterface to the Manager object. Only the 
properties specified by the interface are visible and relevant 
to the Manager. Fig. 2 shows that IMobileInterface 
definition includes single operation with signature: 
goToLocation (Location). The syntax: 
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role name:interface_ name 


explicitly shows the type of the role. With this explicit 
type, the role the Mobile Agent plays is very precisely 
defined by the interface definition. Operations and its 
Signatures are identified in accordance with the 
responsibility a type has in a role. In the specialize or 
manage association, the Mobile Agent presents an entirely 
different face to types on the other part (Fig. 3). 


1 specialize > 1 << type>> 
base agent : Mobile Agent 
\BaseAgent 


<< interface>> 
IBaseAgent 
setAggregation(SpecializedPart) 


<< type>> 


Managed Resource ss: 1 
= manage 
cVar1 : Numeric << type>> 
cVar2 : Numeric adapted agent :|| Mobile Agent 
\AdaptedAgent 


managementOper, 


<< interface>> 
lAdaptedAgent << type>> 
Specilaized 


connectResources(Resources) Part 


Figure 3: base agent and-adapted agent role of the 
Mobile Agent type. 


B. Behavior, Dynamic Binding and Migration 


Generally, a mobile agent, to be autonomous entity, 
contains all necessary control information (i.e. the service 
logic of a particular management service). To model this 
part of Mobile Agent type it is possible to choose among 
informal to more formal approaches. Responsibility of a 
type can be specified as a text in an extra compartment in 
the stereotyped class icon (see Fig. 2.). Dynamic view of a 
type can be expressed by UML statechart diagram. 
Dynamic view includes behavior specification i.e. the 
sequence of states object of a type goes through in response 
to events, together with its responses to those events. 
Constraints that must be true when an operation is invoked 
(precondition) and at the completion of an operation 
(postcondition) can be specified in Object Constraint 
Language (OCL) [3] (see Fig. 4). 


For example, following simple rule can be stated: 
"Operation managementOper () will be done by an object 
of Managed Resource type only if values of its two 
control variables cVar1 i cVar2 are different, else the 
operation will be rejected. After the operation is finished, 
values of both control variables are equal". Fig. 4 shows 
this rule expressed in pre/post condition specification in 


OCL. OCL can be used for expressing semantics of new 
(stereotyped) modeling elements. Syntax and semantic 
definition of some elements useful for management 
applications modeling are described in [8]. 


-- expression context 


ManagedResource: :managementOper () 


--pre/post cond. for managementOper () 
pre : self.cVarl <> self.cVar2 
post : self.cVarl = self.cVar2 


Figure 4: Operation pre/post condition specification (OCL) 


Besides the UML and OCL, a more formal notation, based 
on set theory and first-order logic [7], can also be used for 
system structure specifications and, for constraints 
specifications (integrity, behavior rules, invariants, policies 
etc.). Such formal notation enables inference about 
models, and proving their correctness. 


For modeling the dynamic binding of an abstraction to its 
interface we use the <<become>> stereotype in an UML 
interaction diagram (sequence or collaboration diagram). 
Dynamic binding means changing an object from one role 
to another (see. Fig. 5). In the fragment of collaboration 
diagram shown on Fig. 5, the Mobile Agent object 
changes its role twice. First it becomes a base agent, 
and than an adapted agent. The <<become>> 
stereotypes specifies that the target is the same object as the 
source (always object a: MobileAgent) but at a later point 
in time and with different values, state or roles. 
<<become>> is a standard UML stereotype which is 
applied to message concept [1]. 


a: Mobile Agent 
O [mobile agent] 
{location = Manager 

Subsystem} 


m_: Manager 
{location = Manager 


Subsystem} 1: <<create>> 


2 : goToLoacation(Location) 


location IN 


tagged value 2.41: <cbecome>>J 


Rost a: Mobile Agent 
esd [base agent] 
{location = Managed (ate, 
Subsystem } {location = Managed 
Subsystem } 


3 : setAggregation(S.Part) 


3.1: <<become>>J 


4: connectResources(Resources) 


a: Mobile Agent 
[adapted agent] 
{location = Managed 
Subsystem }} 


t: Managed Resources 
{location = Managed 
Subsystem i} < 
5 : managementOper() 


Figure 5: Dynamic binding aspects (UML collaboration 
diagram) 


Apart from the problem of dynamic binding there is a 
problem of modeling migration of objects. To model the 
migration we render the location of an object explicitly as a 
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tagged value. New allocation of an object is rendered by a 
message stereotyped as become. 


III. PROTOTYPICAL IMPLEMENTATION 


The Object Space's Voyager implements an agent- 
enhanced ORB. Voyager agent system API offers a lot of 
public Java classes and interfaces which enable efficient 
mobile agent applications coding. This is the main reason 
we used this programming system for prototypical 
implementation as well as for testing and performance 
evaluation. Programming snippet on Fig. 6 shows some 
elements of the previously described information model 
translated into the Voyager programming system. 


package voyprog.manager; 
public interface IMobileAgent { 
void goToLocation(String location) ; 


} 


import com.objectspace.voyager.agent.*; 
import com.objectspace.lib.facets.*; 
public class MobileAgent implements 
IMobileAgent, Serializable 
{ 
String location; 
voyprog.managed.ISpecializedPart sPart; 


public MobileAgent() { 
System. out.println("MobileAgent ..."); 
} 
public void goToLocation(String locat) { 
try { 
this.location = new String(locat) ; 
Agent.of (this) .moveTo (location, 
"atManagedSubsystem") ; 
}catch(Exception e) {;} 
) 
// on arriving at new location 
public void atManagedSubsystem() { 
System.out.println("Arrived at "+locat); 
IFacets parts = Facets.of (this); 
sPart = t 
(voyprog.managed.ISpecializedPart) 
parts.get ( 
"voyprog.managed.ISpecializedPart") ; 
if(sPart != null) { 
sPart.gManagedRes () .managementOper () ; 
}Jelse { 
System. out.println(sPart) ;} 
Agent.of (this) .setAutonomous (false) ; 


public class Manager { 
voyprog.manager.IMobileAgent a; 
public Manager() { 
System.out.println("Manager created") ; 
try { 
voyprog.manager.IMobileAgent a 
=(voyprog.manager.IMobileAgent) 
Factory. create ( 
"voyprog.manager.MobileAgent") ; 


a.goToLocation ("//localhost: 7001") ; 
}catch(Exception e) {;} 


} 
) 


Figure 6: Java Programming snippet: the Manager side 


IV. CONCLUSION 


The motivation for the modeling work described in this 
paper is the need for better understanding how to model 
and develop applications using mobile agent technology. 
Autonomy and mobility are probably the most challenging 
properties that provide telecommunications management 
agents with the potential to influence the traditional way of 
communications and service realization. On the same time, 
they open new software modeling problems that must be 
resolved. These two important questions are opened: what 
kinds of modeling language are required and what 
underlying meta-models are required to support agent 
based applications specification and development? 


Information specifications (models) should be produced 
early in the applications development process. These 
specifications should be conceptual i.e. implementation 
technology independent. In this paper we propose how to 
use UML as a notation system [1], [9] for expressing 
information models. Besides standard language constructs, 
UML offer useful capabilities (stereotypes and tagged 
values) for meta-model extension [1], [8], [9]. By using 
these capabilities UML can be tailored to specific needs of 
modeling applications with mobile agents. 


V. REFERENCES 


{1} Booch G., Rumbaugh J., Jacobson I., The Unified Modeling 
Language User Guide, Addison-Wesley Longman, Inc. 1999. 

[2] Kunstic M, Pogacnik F, Sandri N, "A Global Model for Solving the 
Management Problems in Telecommunication System", Conf. on 
Communication Networks and Distributed Systems Modeling and 
Simulation (CNDS'97), Proceedings of the Conf, pp.17-22, 
Phoenix, 1997. 

[3] Object Constraint Language Specification, UML v.1.1, Rational 
Softw. Co., 1997. 

[4] ObjectSpace Voyager, Core Technology 2.0, User Guide, 1999. 

[5] OMG TC Document cf/97-06-04, Crystaliz Inc., General Magic 
Inc, GMD FOKUS, International Business Machines Co., Joint 
Submission, Mobile Agent Facility Specification, June 1997. 

[6] OMG Telecom Task Force, CORBA-based Telecommunications 
Network Management System, OMG White Paper, May 1996. 

[7] Randic M, The Formal Model of Distributed Object-Oriented 
Software for Telecommunications System Management, Ph 
Dissertation (in Croatian lang.), ZZT, FER, University of Zagreb, 
1998. 

[8] Randic M, Kunstic M, Blaskovic B, "Extensions to UML for ORB- 
centric software modeling: the group and policy stereotypes", 
SoftCOM'99 Conf. Proceedings, pp.173-180 , Split-Rijeka-Trieste- 
Venice 1999. 

[9] Unified Modeling Language v. 1.1, Semantics, Rational Softw. Co., 
1997. 


a a RE | Say SETS ee ee ne Pe nN 
18 10" Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


Fuzzy RED: Congestion control for TCP/IP Diff-Serv 
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Abstract-Network congestion control remains a 
critical issue and a high priority. The explosion of 
Internet made the use of effective congestion control 
algorithms a necessity. The fact that Internet relies 
mainly on the TCP/IP protocol suggests that we must 
focus our efforts in this area. The existing TCP/IP 
protocol was designed for best effort services and can 
not support efficiently new user demands for 
integrated services. Currently we see new proposed 
Internet architectures to deal with integrated services 
networks (e.g. Int-Serv, Diff-Serv). The design of 
effective congestion control strategies for integrated 
services networks is known to be difficult because of 
their structural complexity, the nature of services 
supported, and the variety of dynamic parameters 
involved. As a result a number of researchers are now 
looking at alternative non-analytical control system 
design and modeling schemes that have the ability to 
cope with these difficulties in order to devise effective, 
robust congestion control techniques as an alternative 
(or supplement) to traditional control approaches. 

In this paper we study congestion control issues in 
TCP/IP Diff-Serv and present a novel congestion 
algorithm that uses fuzzy logic based control theory. 


Index Terms--TCP/IP, congestion control, Diff-Serv, 
Fuzzy Logic 


I. Introduction to TCP/IP congestion control 


As Internet growth increases it becomes clear 
that existing congestion control solutions 
deployed in the Internet Transport Control 
Protocol (TCP) [',"] are increasingly becoming 
ineffective. It is also generally accepted that 
these solutions cannot easily scale up even 
with various proposed “fixes” ["",'”]. Also, it is 
worth pointing out that the User Datagram 
Protocol (UDP), the other transport service 
offered by IP Internet, offers no congestion 
control. However more and more demanding 
users use this for the delivery of real time 
video and voice services. The newly 
developed (also largely ad-hock) strategies 


_ feedback, 


[’,"] are also not proven to be robust and 
effective. Since these schemes are designed 
with significant non-linearities (e.g. two- 
phase—slow start and congestion 
avoidance—dynamic windows, binary 
additive-increase multiplicative- 
decrease flow control etc) based mostly on 
intuition, the analysis of their closed loop 
behavior is difficult if at all possible, even for 
single control loop networks. Even worse the 
interaction of additional non-linear feedback 
loops can produce unexpected and erratic 
behavior i: Empirical evidence 
demonstrates the poor performance and cyclic 
behavior of the controlled TCP/IP Internet 
[“"] (also confirmed analytically [""]). This is 
exacerbated as the link speed increases to 
satisfy demand (hence the bandwidth-delay 
product, and thus feedback delay, increases), 
and also as the demand on the network for 
better quality of service increases. Note that 
for WAN networks a multifractal behavior 
has been observed ["], and it is suggested that 
this behavior —cascade effect—may be 
related to existing network controls [”"]. Based 
on all these facts it is becoming clear that new 
approaches for congestion control must be 
investigated. 


II. New Internet Architectures: Diff-Serv with 
RED 


A. Diff-Serv 


Since Int-Serv failed, the IETF proposed a 
more evolutionary approach that did not 
require significant changes to the Internet 
infrastructure and provided differentiation of 
services (Diff-Serv) [*"]. To accomplish this 
Diff-Serv uses the ToS bits in the IP header, 
which are now renamed as "DS field" [*"]. 


' Department of Computer Science, University of Cyprus 
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The functions associated With these bits have 
been also redefined. The main issue of the 
Diff-Serv approach is how to standardise a 
simple set of mechanisms for handling 
packets with different priorities set by the DS 
field in the IP header. In Figure 2.1 we can 
see the basic Diff-Serv architecture approach. 
Note that Diff-Serv works only at the edges of 
the network. This means that the priorities are 
set at the edges of the network, which reduces 
the complexity and makes it more scalable. 


Figure 2.1 Diff-Serv Architecture 


On the other hand nothing is done to assure 
that the priorities will actually mean 
something when the packet enters the Internet 
(leaves the edge router). So we can say that 
Diff-Serv provides only a very rudimentary 
QoS, without any quantified guaranties (as is 
the case in ATM). Because of the limited 
number of bits in the DS field, the Diff-Serv 
working group has defined a small set of 
building blocks, called per-hop behaviours 
(PHBs) which are used by the routers to 
deliver a number of services. They are 
encoded in the DS field and they specify the 
forward behavior each packet will expect to 
receive by the individual routers in the 
Internet. When this will be used on an end-to- 
end basis it is envisioned that it will offer 
support to a number of emerging applications. 
The two PHBs being standardised are the 
Expedited Forwarding (EF) [*"], and the 


Assured Forwarding [**]. The EF PHB 
specifies a forwarding behavior in which 
packets see a very small amount of loss and a 
very low queuing delay. In order to ensure 
that every packet marked with EF receives 
this service, EF requires from every router to 
allocate enough forwarding resources so that 
the rate of incoming EF packets is always less 
than or equal to the rate at which the router 
can forward them. This is done through a 
Service Level Agreement (SLA) during the 
connection setup. In order to preserve this 
property on an end-to-end basis, EF requires 
traffic shaping and reshaping in the network. 
Although there is no specific method set for 
this, it will most probably be a leaky bucket 
buffering algorithm. The AF PHB group 
specifies a forwarding behavior in which 
packets see a very small amount of loss. The 
AF PHB group consists of four independently 
forwarded classes. Within each class, two or 
three drop preference levels are used to 
differentiate flows in the class. The idea 
behind AF is to preferentially drop best-effort 
packets and non-contract conforming packets 
when there is congestion. By limiting the 
amount of AF traffic in the network and by 
managing the best-effort traffic appropriately, 
routers can then ensure low loss behavior to 
packets marked with the EF PHB. As we can 
see Diff-Serv will try to provide some QoS 
using a drop-preference algorithm when 
congestion occurs. 


B. RED 


The most popular algorithm used for Diff- 
Serv implementation is RED (Random Early 
Discard) [*’]. RED simply sets some min and 
max dropping thresholds for each class. In 
case the buffer queue size exceeds the min 
threshold, RED starts dropping randomly 
packets based on a probability which depends 
on queue length. If the buffer queue size 
exceeds the max threshold then every packet 
is dropped, i.e. drop probability is set to 1. 
The RED implementation for Diff-Serv 
defines that we have different thresholds for 
each class. Best effort packets have the lowest 
min and max threshold and therefore they are 
dropped with greater probability than packets 


of AF or EF class. Also there is the option that 
if an AF class packet does not comply with 
the rate specified it will be remapped to best- 
effort class packet. Apart from RED many 
other mechanisms such as n-RED, adaptive 
RED, BLUE [*“"] and Three color marking 
were proposed for Diff-Serv queue control. 


In Figure 2.2 we can see a simple Diff-Serv 
scenario where RED is used for queue 
control. 


Bi Oars WY pes rat 


Figure 2.2 Diff-Serv scenario with RED 
queue for control 


A leaky bucket traffic shaper is used to check 
if the packets comply with the SLA. If EF 
packets do not comply with the SLA then they 
are dropped. For AF class packets, if they do 
not comply then they are remapped into Best 
Effort Class packets. Both AF and Best effort 
packets are sharing a RIO Queue. RIO stands 
for RED In/Out queue, where In and Out 
means packets are In or Out of connection 
conformance agreement. For AF and Best 
Effort class we have different min and max 
thresholds. EF packets are using a separate 
high priority FIFO queue. 


III Fuzzy RED 


In this paper we propose a novel approach to 
the RED, Diff-Serv implementation. We 
propose Fuzzy RED, a fuzzy logic controlled 
RED queue. To implement it, we removed the 
fixed max, min queue thresholds from the 
RED queue for each class and replaced them 


with dynamic network state dependant 
thresholds calculated using a fuzzy logic 
controller (FLC). As was reported in [xvii] 
classical RED implementations with fixed 
thresholds can’t provide good results when we 
have dynamic network state changes, e.g. in 
the number of active sources. The FLC 
calculates the drop probability behavior based 
on two network queue state inputs: the actual 
queue size and the queue rate of change. In 
implementation we add an FLC for each Diff- 
Serv class of service. The FLC for each class 
uses separate rules and sets for calculating the 
drop probability based on the input from the 
queues. Since the 2 input FLC can offer better 
ability to linguistically describe the system 
dynamics we expect that we can better tune 
the system and the behavior of the RED queue 
according to our class of service policy. The 
dynamic way of calculating the drop 
probability by the FLC comes from the fact 
that according to the rate of change of the 
queues, the current buffer size and the class 
the packet belongs, a different set of fuzzy 
rules applies. Based on these rules the drop 
probability is calculated. Since the whole 
procedure is independent of the number of 
active sources we avoid the problems of fixed 
thresholds mentioned by other RED schemes 
[xvii]. With fuzzy RED not only we expect to 
avoid such situations but also to generally 
provide better congestion control and better 
utilization of the network. 


Below we present example of fuzzy rules and 
sets 

Value sets for Premium category queue 
lingvar q on [0,90] initially 0.0 with lingvalue 
empty: 1s" 0" Or 18°35 
lingvalue moderate is 20 33 42 63 
lingvalue full is 44 64 90 90 


/* rate of change of queue length - common 
for all categories */ 

lingvar dq on [-44,44] initially 0.0 with 
lingvalue decreasing is -44 -44 -7 -1 
lingvalue zeros is -14 O O 14 

lingvalue increasingis 1 7 44 44 


/* output variables: */ 
/* drop rate probability */ 
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lingvar pr_premium on [0.0,1.0] initially 0.0 
with 

lingvalue zero 
lingvalue low 
lingvalue medium 
lingvalue high 


is -20.0 -10.00.0 0.0 
is 0.0 0.10 0.10 0.20 

is 0.15 0.20 0.20 0.30 
is 0.20 0.30 0.30 1.00 


/* linguistic rules for premium class*/ 

if q is empty then pr_premium is zero; 

if q is moderate and dq is decreasing then 
pr_premium is zero; 

if q is moderate and dq is zero 
pr_premium is zero; 

if q is moderate and dq is increasing then 
pr_premium is low; 


then 


if q is full and dq is decreasing then 
pr_premium is low; 

if q is full and dq is zero then 
pr_premium is medium; 

if q is full and dq is increasing then 


pr_premium is high; 
V Conclusions 


Current TCP/IP congestion control algorithms 
cannot efficiently support new and emerging 
services needed by the Internet community. 
Diff-Serv and RED propose a solution. Our 
proposal in implementing a fuzzy logic 
controlled RED queue is a novel, more 
effective, robust and scalable approach. Based 
on our past experience with successful 
implementations of fuzzy logic control [*”""] 
we are very optimistic that this proposal will 
offer improved results. Further work and 
investigation is currently underway. 
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Data Security on Ethernet LANs 


Nikola Hadjina and Philip Thompson 


Abstract—A software independent solution for Ethernet LAN 
security is given by introducing a hardware device between 
protected host and network. The device gives the host access 
only to data addressed to it, without interfering with 
communication between unprotected hosts, which operate 
independently of the protected system. The purpose of this 
paper is to describe the thinking behind the design of a 
practical implementation of the above mentioned device. This 
will include the choice and design of the encryption system and 
the key management system. The paper will first discuss the 
design requirements and then describe how they were 
implemented. A detailed description of the encryption 
algorithm, simulated one time pad key system with distributed 
management is also given. Finally, there will be a discussion of 
the performance of tested systems. 


Index Terms—Encryption, key distribution, key management, 
one-time pad, network security 


I. INTRODUCTION 


Local Area Networks,.such as Ethemet, are broadcast 
systems. This results in data, communicated over the 
network, being available to all users, unless the data are 
suitably encrypted. If it is required that users are able to 
access only the data that is directed to them, key 
management systems are required. Such key management 
systems require layers of software that can interfere with 
the software used for data management. Thus, they can 
significantly degrade the usefulness of the network. 
Furthermore, if “hackers” are able to gain access to 
software, security will be reduced. Dr. E. Stewart Lee [1] 
suggested that these problems could be solved by 
introducing a hardware device between each host computer 
and the network, which would prevent any host from 
reading data not specifically addressed to it. See Fig. 1. 
This device would appear to both the host and the network 
as a software independent repeater and its operation would 
not be accessible to unauthorized operators. Without it, the 
host would not be able to communicate with the protected 
systems and would not have access to the protected data. 


II. THE SECURITY DEVICE AS PROPOSED 


A system of hosts can be made secure if information can 
be received and understood only by the intended recipient 
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and cannot be received, or tampered with, by others. The 
proposal is that, where data security is required, it is 
transmitted over the network only under encryption and that 
it is decrypted only for the intended recipient. The proposal 
telates to local area networks and “Ethemet” type systems 
in particular. However, the ideas are general and may be 
applied to other networks. 


SEGMENT OF LOCAL AREA NETWORK 
(Protected Data - Encrypted & Unprotected Data - Clear Text) 


TAP 4 TAP 2 TAP 3 


CLEAR TEXT CLEAR TEXT 
FOR PERMITTED FOR PERMITTED 
DATA ONLY DATA ONLY 


Figure 1: The Connection of Protected Hosts ata LAN 


In the Etheret example, a security device is connected 
between the network and “protected” host. It is assumed 
that the network will carry both protected and unprotected 
data and that the data transfer between hosts will be 
controlled by any one of the currently available software 
packages. An unprotected host, that is a host directly 
connected to the network, will be able to view all 
unprotected data broadcast over the network. Protected data 
will be encrypted and not available to an unprotected host. 
All, or any part, of the data packet may be encrypted 
according to the requirements of the security policy in 
force. Thus, if the security policy requires that the addresses 
be encrypted, an unprotected host will be unable to 
determine either the source or the intended recipient. In all 
other ways, a protected packet will meet the same network 
requirement as a normal unprotected packet. A protected 
host will be connected to the network through a security 
device, which contains the host’s address. This will repeat, 
to the host, the decrypted version of any encrypted packet 
addressed to it. All others packets, whether encrypted or 
plain text, will appear as similar information free “filler” 
packets. Any packet, transmitted by the host, will be 
encrypted and delivered to the network, providing the 
originator’s address matches the host address. The result of 
this strategy would be not only that hosts would be 
prevented from viewing information not addressed to them, 
but also that the source of any information they transmitted 
would be properly identified. 
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The removal of the responsibility for encryption and 
decryption from a host removes the need for a separate 
encryption key for each host to host data exchange, because 
no host is privy to the key. However, the key should be 
changed at appropriate intervals to reduce the probability of 
its being broken by an intruder. Thus, a key management 
system is required. Although the key management can be 
implemented by a key management controller, Dr. E. S. Lee 
considered it preferable to implement it through a 
mechanism within each security device, where all security 
devices on a network play an equal role. In this way, the 
key management can be made independent of the number 
of hosts in a single hardware security domain and hosts can 
be added or subtracted, without the need to reprogram the 
controller. Furthermore, it was desirable that it be possible 
to connect several independent hardware security domains 
to a single physical network without the danger of data 
being transferred between domains. 

The design of such a security device is dependent upon 
the type of network for which security is implemented. 
However, the important common feature is that it is outside 
the control of host software. For Ethernet, this could be 
inclusively anywhere between the network and card 
plugged into the host bus. Because Ethernet does not 
tolerate added signal delay, without a reduction of 
performance, it is important that the delay introduced by 
adding the security device is kept to minimum. 


Til. ENCRYPTION CONSIDERATIONS 


The Data Encryption Standard (DES) [4] is one of the 
most commonly used encryption systems today. It is shown 
in diagrammatic form in Fig. 2. and is an example of a more 
general class. In this class, the encryption takes place on 
blocks of data (64 bits for DES) under the control of a key 
(56 bits for DES). For DES, the same key is used for both 
encryption and decryption, but this is not true for all 
members of this class. A further requirement of this class of 
encryption systems is that, although the output can be 
calculated from knowledge of the input and the key, it is not 
feasible to calculate the key from knowledge of the input 
and output. In the case of DES, the encryption/decryption 
algorithm is well known, but in some systems knowledge of 
the algorithm is protected. In any case, it must be assumed 
that a potential intruder has obtained knowledge of the 
algorithm and the confidentiality of the key must be relied 
upon to give protection. The main disadvantage of this class 
of encryption systems for this application is that a complete 
block of data must be input before the first bit can be 
output. Using the example of DES, changing any one of 
the 64 input bits can cause a change in any or all of the 64 
output bits. Thus, the encryption mechanism introduces a 
delay of at least 64 bits in the security device. In practice, 
this delay is longer, because of the processing time in the 
encryptor and, in a device design, it is wise to allow for a 
delay of 128 bits. In a single protected data transfer, this 
would result in there being a total delay of 256 bits (128 
bits for encryption and 128 bits for decryption) caused by 
the encryption system alone. This would significantly 


reduce the performance of Ethernet for a protected domain 
and this class of encryption systems was rejected. 


64 BIT BLOCK 64 BIT BLOCK 


PLAIN TEXT 


ENCRYPTED DATA 
DATA INPUT OUTPUT 


56 BIT SECRET KEY 


COMMUNICATION CHANNEL 


64 BIT BLOCK 64 BIT BLOCK 


ENCRYPTED DATA PLAIN TEXT 
INPUT DATA OUTPUT 
56 BIT SECRET KEY 


Figure 2: A Schematic Representation of DES 


The encryption/decryption delay for “one pad” systems 
can be significantly less than | bit time. As is illustrated in 
Fig. 3., the only delay in the signal channel is that of the 
propagation time of two exclusive OR gates. However, the 
main difficulty with a one pad encryption system for this 
application is the length of the key. The length of the 
protected key required for merely 1 hour of data 
communication over Ethernet (at e.g. 10 MHz) is 3.6x10"° 
bits. Although the storage of the above mentioned key 
length is feasible using modem data storage techniques, the 
storage of sufficient one pad key for the security devices 
proposed is not. The use of a simulated one pad key is 
feasible, providing it is constructed in a way that makes it 
effectively impossible for an intruder to predict the next bits 
based upon observations of what has gone before. Rather 
than merely cycling though memory addresses, as would be 
Tequired for a true one pad key, the simulated one pad key 
can be implemented by computing the key from the 
contents of memory. As with the DES, the algorithm for 
computing the key should be made so that it can be made 
public, providing the contents of the memory are secret, 
random and unpredictable. 


PATE a ee 


Figure 3: A Schematic Representation of One Pad 
Encryption and Decryption 
The above discussion implies the concept that the 
strength of the encryption is 2 function of both the 
complexity of computation and the size of the shared secret. 
For the purpose of this discussion, the secret will be taken 
to mean the contents of a memory, although the 
computation algorithm may also be past of the secret. In 
the case under discussion, it will be assumed that potential 
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intruders will have knowledge of the algorithm. It is 
possible to extend this idea to assess the security of 
encryption systems in general. DES is an example of a 
system that uses a small secret (56 bits), but depends upon a 
complex computation algorithm, while a true one pad 
system represents the other extreme [3]. There are many 
possible examples, which fall between these two and the 
simulated one pad key system, chosen for the security 
device, is one of these [5]. 


A. A Simulated One Pad Key Encryption System 


A simulated one pad key encryption system was chosen 
because, like a true one pad system, it imposes an 
insignificant delay on the signal and the size of the memory 
can be selected to provide adequate protection. In the final 
design, a 64 bit address was used for a simulated 2™ bit 
memory. The simulated memory consisted of only a 64 
Kbit main memory addressed from sixteen 16 bit memories, 
each with a 4 bit address field, representing another 256 bits 
of secret. This arrangement is illustrated in Fig. 4. It is only 
one of many possible arrangements for a two level 
implementation of a memory structure with a 64 bit address 
and could easily be replaced by one that stored a larger 
secret if more security was required. Note that no rigorous 
analysis has been possible for the security of this particular 
system, but results to date suggest that it is more than 
adequate [2]. The address field is generated by a 64 bit 
linear recursive shift register (LRS), which will cycle 
through 2™ — 1 addresses before repeating. If this were 
driven by a 10 MHz clock a complete cycle would take 
about 50,000 years. 


B. Key Management 


The address can be reset to any part of the above 
mentioned cycle, using a 64 bit key resynchronizing word. 
In the current implementation, this is done by key 
distribution packets, which are transmitted from one 
security device at a time and received by all the others on 
the same network segment. The contents of memory 
structure are required to construct the packet and to decode 
and verify the packet. Thus, the synchronization word is 
hidden from intruders and is not possible for an intruder to 
introduce fake key distribution packets. 

An algorithm was devised, which ensures that valid key 
distribution packets will appear on the network at regular 
predetermined intervals and permits all security devices in a 
single domain to generate these forming a queue to transmit 
the packet. When a given device transmits a packet, it goes 
to the back of the queue and does not transmit another until 
all the others have done so. When a device is first 
connected to the network, it goes to the back of the queue. 

The queue algorithm works as follows: Each security 
device contains a key distribution timer, which determines 
the interval between key distribution packets. When a 
security device is first switched on, or when it has 
transmitted a key distribution packet, the timer is set at 
maximum. The timer now counts down towards zero. If the 


security device observes a key distribution packet on the 
network before it reaches zero, it will reset the timer to one 
interval lower than it was set for the previous count. Thus, 
each time it is reset, the timer will start at a lower count. 
When the timer is set to a lower count than all those of the 
other security devices, it will reach zero before any of the 
others and trigger the transmission of a new key distribution 
packet. 


4am 
ADORESS OUTPUT 
16 BIT 
64 KBIT 
ONE OF 16 MEMORY 


(WHICH 
DEPENDTS 
UPON ALL 
2°64 INPUTS) 


SIMILAR 
MEMORIES 


Figure 4: A Simulated 2**64 bit Memory 


A side effect of the above mentioned algorithm is that if 
the key distribution packet is involved in a collision, this is 
signaled to the security device in the normal way and its 
packet will be lost. It will then go to the back of the queue 
and the next security device in line will transmit a key 
distribution packet. There are several scenarios (which will 
not be described in this paper), which would cause a 
collision between key distribution packets. If this happened, 
there is a possibility that more than one device could start at 
the back of the queue at the same time and it is further 
possible that the key distribution packets would enter a 
“knot”, in which they would continue to collide each time 
they were transmitted. It is possible to construct a scenario 
where an increasing number of security devices would get 
caught in this knot and eventually all key distribution 
packets would be caught. The result would be that the 
security devices in the affected domain would no longer 
receive resynchronization packets. Thus, a special case 
algorithm needs to be implemented, when a key distribution 
packet is involved in a collision. Instead of resetting the 
key distribution timer to the longest count, the key 
distribution timer would be set to a count that ensures that 
the packet will be ahead of the queue. For this purpose, 
each security device in a single domain must be assigned a 
count that is unique in its domain. Then, following a 
collision, the order in which they transmit their key 
distribution packets will be determined by the assignment 
of these counts. 


C. Synchronization of a Simulated One Pad Key 


The above discussion describes how long one pad keys 
can be simulated and how synchronization packets can be 
used to move from one point in the key sequence to 
another. Also, the start point, in the sequence, must be 
changed for each data packet for protection to be retained. 
If all hosts on a single network segment saw exactly the 
same number of packets and bits, synchronization would be 
automatic. However, this does not always happen, because 
of the combined effect of network propagation delays and 
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collisions between packets. The solution to this is to 
advance a fixed distance through the key sequence (longer 
than the maximum permitted packet length) for each 
packet. This imposes no serious limitation on the system, 
because of the length of the simulated key. Care must be 
taken with the count of the number of packets since the last 
resynchronization packet. Collisions can result in packet 
fragments and propagation delays can result in different 
numbers of fragments being observed at different locations. 
For this reason, fragments must not be counted and only 
packets equal to or longer than the minimum permitted 
packet length be used to determine the new key start point 
for a new packet. 

Although, the above mentioned strategy was adopted, it 
can result in the same key being used twice in two 
successive packets. If a packet is terminated early because 
of a collision, the next packet starts with the same key. This 
would permit an observer, who knew the contents of the 
aborted packet, to deduce the key for the first part of the 
following packet. This was not found to be a serious 
problem, because the part of the packet the observer would 
be able to decrypt, as a result, does not usually contain data. 
Also, during our laboratory tests, we were not able to 
capture the above mentioned situation to test it. However, it 
remains a possible threat. 


D. Performance of Practical Implementation 


Two separate systems have been fabricated and tested 
on a laboratory Ethernet. The first was a relatively simple 
system, constructed primarily of small and medium scale 
integrated circuits. The security devices were modularized 
and each module was constructed as a separate printed 
circuit board to permit rapid modification. The encryption 
system used a simulated one pad key and a 1.5 Mbit secret, 
stored in three 0.5 Mbit EPROMs. The key was organized 
to have 2” start points and used 32 bits of the key 
distribution packet to address them. 

This first system functioned well and it was possible to 
test alternative strategies for key synchronization etc.. The 
queue system was particularly convincing and it was 
observed that the order of transmitting key distribution 
messages was always maintained after the system was first 
switched on. The protected system was further tested in the 
presence of unprotected traffic on the same network. No 
communication was possible between protected and 
unprotected hosts. A protected host was able to read only 
packets addressed to it and was not able to detect anything 
about any other packet except for its length. Unprotected 
hosts operated normally between themselves and were not 
able to gather information from protected traffic. The 
results of the tests on this system permitted the design 
specification to be written for the second system. 

The second system was designed to fit a small package 
that had the appearance of a final commercial unit. Most of 


described under Section III B. The original queue system 
was used again, with no provision for the avoidance of 
knots. The simulated one pad key was constructed as a 
single 2™ bit stream with the approximately 64 Kbit secret 
size described under Section III A above. This differed 
from a production design in that the secret had to be 
installed by inserting a preprogrammed 64 Kbit EPROM 
and individual programming was required for each field 
programmable integrated circuit. For a production device, 
custom mask programmed large scale integrated circuits 
would be used and stored data would be held by a battery 
supported memory. 

This system was operated and tested on the same 
laboratory network as the first system and demonstrations 
were performed on the networks of some potential users. At 
all demonstrations installed by the design group members 
or where the installation was observed by them, the system 
performed as it was designed. The experience gained from 
these tests and demonstrations was used in the writing of 
the design specification for a production unit. 


ITV. CONCLUSION 


The work described above was terminated after the 
creation of the draft provisional specification for a 
production unit, because further funding for the project was 
not available. However, there have been two useful results, 
both of which can be considered separately. The system, 
described under Section III B. above, may merit further 
study. Also, the development and the implementation 
methodology have resulted in ideas that may be useful for 
other systems. 

The systems implemented were designed to operate on a 
single segment of a LAN, where all hosts can observe all 
packets. In the WAN environment, implemented by 
bridges and other communication devices, special 
provisions must be made. The production device 
specification takes account of this. This, along with several 
important aspects of security device and system design, will 
be discussed in future papers. 

The above mentioned work was performed in Canada at 
Quinte Network Confidentiality Equipment Inc. by a team, 
of which the authors were part. 
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Abstract-- The design characteristics of Multimode 
Interference-based NxN devices, such as NxN power splitters, 
Waveguide Routers, and Miultiplexers-Demultiplexers for 
Wavelength Division Multiplexing applications, have been 
studied. The power loss, loss imbalance and _ phase 
characteristics of the above devices on Silicon and InP have 
been examined by using the finite element-based beam 
propagation approach. An arrayed waveguide grating where 
the refractive index contrast or the width of the adjacent 
waveguides differs by a constant refractive index difference, 
An, or width difference, Aw, respectively, has been introduced 
to produce a wavelength dependent phase-shift, instead of the 
conventional arrayed waveguide-based arrangement with 
constant path length difference dispersive elements. 


Index Terms—multimode interference, wavelength division 
multiplexing, power splitters, waveguide arrays. 


I. INTRODUCTION 


Multimode Interference (MMI) coupler devices based on 
the self-imaging property [1] have been utilized as key 
components in several wavelength-division multiplexing 
(WDM) systems, such as power splitters [2], arrayed 
waveguide filters [3], MMI-PHASAR-based 
multiwavelength lasers [4] and Mach-Zender 
Interferometric switches [5]. Their ease of fabrication, 
compactness, low-loss and smaller polarization dependence 
are the main features of these MMI devices that make them 
attractive for such applications. Arrayed Waveguide 
Gratings (AWG) are widely used as dispersive elements in 
several WDM applications, such as optical switches and 
multiplexers/ demultiplexers, where a curved waveguide 
array with a constant path length difference between the 
adjacent waveguides causes a wavelength dependent phase- 
shifting. In the present work, NxN MMlI-based power 
splitters have been used as focusing/defocusing slab 
waveguides and an arrayed-waveguide grating has been 
introduced, where the adjacent waveguides have the same 
length but different dielectric properties. The refractive 
index contrast of the adjacent waveguides in the arrayed 
waveguide grating can differ by a constant value thus 
giving a wavelength dependent phase-shift for each 
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waveguide. The different refractive index contrast of each 
waveguide, in a grating where all the waveguides have the 
same dimensions, can be implemented by the use of 
thermo-optical or electro-optical switches in silica- or InP- 
based systems, respectively. An alternative approach, 
introduced in the current work to achieve different 


‘refractive index contrast in each waveguide is an arrayed 


waveguide-grating, where each waveguide has different 
width. 


Ws neds) FINITE ELEMENT-BASED 
PROPAGATION METHOD 


BEAM 


The Finite Element Method (FEM) [6] has been established 
as one of the most powerful numerical techniques, because 
of its accuracy, flexibility and versatility in the numerical 
analysis of optical waveguides, particularly in structures 
with arbitrary shapes, index profiles, nonlinearities and 
anisotropies. The Beam Propagation Method (BPM) 
approach based on the paraxial wave equation [7] is widely 
used for the study of lightwaves in integrated optical 
waveguides. The Finite Element-based BPM (FEBPM) [8] 
has been used in the present work, providing an effective 
numerical tool for the analysis of the devices examined, 
since it utilizes all the advantages of the FEM. Although a 
3D BPM simulation is possible for the present analysis, it 
is, however: computer extensive, and therefore the original 
structures were reduced to one-dimension by using the 
effective index method [9] to increase the efficiency of the 
simulations. 


IJ. NXN MMI-BASED POWER SPLITTERS 


Initially, a 16x16, 300um wide, silica-based power splitter 
with a refractive index contrast of 0.75% between the core 
and the substrate refractive indices, n, and n, respectively, 
at 1550nm operating wavelength, has been considered. A 
study of the evolution of the optical power along the device 
has been carried out when an input field profile is launched 
into the leftmost port, 16, via single-mode waveguides with 
a cross section of 6 tum x 6 pum and the device was 
simulated by using the FE-BPM algorithm. From the power 
loss characteristics obtained, the minimum power loss of 
the device was found to be 1.8dB at a length of 21.58mm. 
The output field profile of the 16 images of the input field 
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profile at the optimum length of 21.58mm, where the 
device exhibits minimum power loss, is shown in Fig.1. 


A deeply etched InP-InGaAsP (A,=1.3 pm)-InP, ridge 
waveguide structure, at an operating wavelength of A=1550 
nm, was also examined. After considering the minimum 
spacing between the input and output waveguides required 
to prevent coupling, a 24,um MMI device has been selected 
for the simulations. The length of the 16x16 device, Lynn. 
was calculated from the coupling length obtained by using 
the FEM and found to be 310.2um and 313.S5pm for the TE 
and the TM polarizations. respectively. 


Initially, the evolution of the optical power along the device 
is determined when an input field profile is launched into 
the leftmost port, via a single-mode deeply etched, lm 
wide, single mode ridge waveguides with the same material 
properties as the MMI structure. To observe clearly the 
evolution of power in the 16 channels, the normalized 
power characteristics of the device along the direction of 
propagation for the TE polarization are shown in Fig.2. It 
can be seen that within the range of 309,um and 312um, the 
normalized power characteristics are in close agreement. 
The output channels carry maximum power at about 
310.2um, which coincides with the length of the device 
calculated earlier for the TE polarization. It can also be 
observed that at 309jum most of the channels have an equal 
power output, but the total power loss was quite high at this 
position, and the resolution of the output field profile, not 
shown here, was deteriorated slightly. 

The power loss characteristics for the TE and the TM 
polarizations of the input field along the direction of 
propagation (z), within the range of the length of the device 
calculated earlier, are shown in Fig.3. The minimum loss 
for the TE and the TM mode was found to be 0.14dB and 
0.28dB at a distance of 310.7pm and 314pm respectively. 
The best resolution of the 16 images of the input field for 
the TE and the TM polarization was observed at the length 
where each polarization had minimum total loss, as 
mentioned above. 

A deeply etched InP-InGaAsP (A,=1.1 pm)-InP, ridge 
waveguide structure, with a guiding layer 0.8m high, at an 
operating wavelength of A=1550 nm, has been considered. 
A 128x128 power splitter was simulated, to demonstrate the 
capability of the present algorithm to handle a larger 
number of input/output channels. After considering the 
minimum spacing of 1.5m between the lym wide input 
and output waveguides, required to prevent evanescent 
coupling, a 320um MMI device has been selected for the 
simulations presented and is discussed below. 


The length of the structure for the TE polarization was first 
calculated from the coupling length. L,, by using the FEM 
modal solution and found to be 6.767mm. The MMI-device 
was then simulated for an input field profile launched into 
the rightmost port and the power loss and loss imbalance 
characteristics of the 128x128 power splitter, obtained by 
using the FEM-based BPM, are presented in Fig.4. From 
the power loss curves it can be seen that the device exhibits 


a minimum power loss and minimum loss imbalance of 
1.08 dB and 0.72 dB respectively, at a distance of 6.773mm 
from the origin along the direction of propagation, which is 
close to the length calculated by using the FEM. The length 
for the TM polarization of the above device has also been 
calculated by using the FEM modal solution and was found 
to be also 6.767mm, which indicates that the design is 
polarization independent. 


The field intensity at the above position is shown in Fig.5, 
where the field intensity of the first 16 output channels is 
shown as an insert. As can be seen from the above field 
profiles, although the balance of the maximum field 
intensities is not perfect, however, the power within the 
output waveguides is quite high, thus giving low loss 
imbalance and there is no overlap between the output 
images. A total loss of only 1.08 dB indicates that 80% of 
the input power has been coupled to 128 ports. The loss 
imbalance value of 0.72dB identifies that the ratio between 
the port with the highest and the lowest power in all the 128 
ports differs only by 20%. 


TV. NXN MMI-BASED MULTIPLEXERS/DEMULTIPLEXERS 


NxN MMI-based Multiplexers/Demultiplexers consist of 
two NxN MMI-based power splitters interconnected via an 
AWG-based phase-shifter. Curved Arrayed Waveguide 
Gratings with constant path length difference are widely 
used as phase-shifting devices in  multiplexing/ 
demultiplexing systems. Curved waveguides introduce 
bending loss that can be minimised by using larger bending 
radius, at the expense of larger chip sizes. In the present 
work, arrayed-waveguide gratings with straight waveguides 
have been introduced, where the phase-shift difference 
between adjacent waveguides has been implemented by 
using either a different refractive index contrast in each 
waveguide or a different width. Input wavelengths can be 
switched from one port to another, by designing the 
wavelength dependent phase-shift between adjacent 
waveguides in the AWG section, according to the required 
channel spacing of the Multiplexer/Demultiplexer system. 
The design of an AWG, with the waveguides having the 
same length. is a trade off between the length of the array, 
and the dielectric properties of each waveguides. Longer 
waveguides require larger chip size, while high refractive 
index contrast or large width can lead to undesirable multi- 
mode waveguides. 


Initially, a deeply etched InP-InGaAsP (A,=1.1 pm)}InP. 
ridge waveguide structure, with a guiding layer 0.8m high, 
at an operating wavelength of A=1550 nm. has been 
considered. For the multiplexing stage of the multiplexer/ 
demultiplexer system, a 6m wide 4x4 Power Splitter, with 
lum wide input waveguides spaced by 1.Sym, has been 
used. This structure was simulated, for an input field 
launched at the leftmost port, 4, by using the BPM approach 
and the length of the device was found to be 75.7um. The 
input mode launched propagates through the structure and 
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splits into 4 equal fields at the end of the device. The phase 
differences between the output ports were also calculated 
and found to be in good agreement with the theoretical 
values derived from the equations given by Bachman ef al. 
[10]. 


Initially, a single wavelength has been used, and the 
operation of the device as an optical switch has been 
studied. The field at any input port can be switched to the 
required output port by adjusting the required phase-shift at 
each waveguide in the phase-shifter stage. Since the same 
length for all the waveguides has been considered, electro- 
optic switches that vary the dielectric properties of each 
waveguide can be used to implement the phase-shift change 
in each waveguide. 


An AWG-based phase-shifter stage with 4 straight 
100um long waveguides has been considered. The 
refractive index differences, An, for each waveguide in the 
AWG, with respect the leftmost output waveguide 1, have 
been assumed to be: 


Anj=nj-n|=0, 0.01, 0.01, 0 (i=1,4) (1) 


where n;, n, are the refractive indices of each output 
waveguide, i, and the leftmost output waveguide, 1, 
respectively. 


The relative, phase-shifts between the waveguides in an 
AWG remain constant if the length and the refractive 
indices of all the waveguides are the same. By introducing 
refractive index differences between the waveguides in the 
AWG, proportional phase-shift differences, A®, are 
produced. For the refractive index differences applied 
above the phase-shift difference between each waveguide 
and the leftmost waveguide 1, are: 


AD=0,-0 = 0, 1/2, 0/2, 0 (i= 1,4) (2) 


where ®i and ®, are the phases of each waveguide, i, and 
the leftmost waveguide, 1, respectively. With such phase- 
shift differences in the AWG, an input signal launched 

at the leftmost port has been switched to the leftmost output 
port. Next, the refractive index differences between the 
output waveguides were modified to: 0, -0.01, -0.01, 0 and 
the corresponding phase shift differences were changed to: 
0, -1/2, -1/2, 0. With the above phase differences in the 
AWG, a field launched in turns at each of the input ports 1, 
2, 3, and 4, will be switched to output ports 4, 2, 3, 1, 
respectively. The output field profiles for the input fields 
launched independently at each of the input ports are shown 
in Fig.6. The average crosstalk of the above output fields 
was found to be less than -22dB. 


The AWG waveguides the structure with the variable 
refractive index, examined earlier, were replaced by straight 
waveguides having variable width but the same refractive 
index and length. The total length of the array, Lanay, Was 
kept at 100um and the widths of the middle waveguides 2 


and 3 of the AWG were changed from lum to !.2um. The 
phase-shift difference of these waveguides with respect to 
the leftmost output waveguide, 1, was found to be 1/2, and 
therefore by launching an input field at the leftmost input 
port, 4, the output will be switched to output port, 4. The 
crosstalk in the output field was found to be less than 22dB. 


V. CONCLUSION 


A numerical finite element-based 1D beam propagation 
approach has been used to analyse the power, attenuation 
and phase properties of NxN multimode interference-based 
devices for WDM applications. The phase relationships 
between the different channels for the above devices have 
been calculated were in good agreement with those 
obtained by using theoretical analytical equations. Power, 
Loss Imbalance, and the polarization dependence properties 
of several InP-based devices have also been examined The 
characteristics of the NxN optical switch and multiplexer/ 
demultiplexer structures formed by introducing straight 
optical waveguides in an arrayed waveguide grating, 
instead of the conventional curved AWG, have also been 
examined. The straight arrayed waveguide sections that act 
as wavelength dependent phase-shifters reduce the chip size 
of the above devices and they incorporate optical 
waveguides with different refractive index contrast or 
different width. It has been shown that optical waveguides 
with constant width difference can be used to replace the 
curved waveguides with constant path length difference in 
multiplexing/ demultiplexing systems. 
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SD+: Improving TCP Performance over ATM 
UBR Service Using Modified Selective Drop 
Buffer Management Scheme 


Aly E. El-Abd, Member IEEE and Mohamed A. Mostafa 


Abstract 

A new packet discard scheme called Selective Drop with 
forgiveness mechanism that can be plugged into the 
existing Selective Drop buffer management scheme is 
developed. The new scheme that we call SD+ remarkably 
enhanced both throughput and fairness of TCP running 
over ATM-UBR service. Our scheme does not require per 
VC accounting but per VC-state where only one bit is 
needed. 

The focus here will be on demonstrating and analyzing the 
effect of the forgiveness mechanism when plugged into the 
SD scheme in low latency networks. The SD scheme is 
selected because it is evolved as an improvement over both 
PPD and EPD policies that can be practically applied to 


| existing ATM switches. This paper explains the concepts 


behind the new scheme and demonstrates its effect on TCP 
performance using simulation. 


A. INTRODUCTION 
UBR service is characterized by its simplicity, low 


~ overhead and cost effectiveness. It is also characterized by 


the lack of built in congestion control mechanisms [5]. As 
85% of computer networks are running TCP/IP [13], it is 
expected that many TCP implementations will use the 


__UBR service category. 

_ Several studies have analyzed the performance of TCP over 
_ the ATM-UBR service concluding that TCP applications 

_ running over ATM-UBR switches with limited buffer size 


suffer from low throughput and are deprived from fairly 
sharing the available bandwidth [1, 7, 9,13]. Accordingly 
several end system side and network side enhancements 


| had been proposed to improve this situation. 


Network side enhancements focused on improving the 
packet discard polices of ATM-UBR switches, aiming to 
avoid -as much as possible- hurting TCP throughput under 
network congestion conditions. 

Network side drop policies can be roughly divided into two 
categories: simple discard policies and sophisticated buffer 
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management policies. Simple discard policies like partial 
Packet Drop (PPD) or Early Packet Discard (EPD) lead to 
enhanced throughput but low fairness. Sophisticated 
mechanisms like Selective Drop (SD), Random Early 
Detection (RED) and Fair Buffer Allocation (FBA) 
improve fairness and throughput [1,3] but require per VC 
accounting with considerable overhead and computations 
that increase algorithm complexity, in addition to being 
parameter sensitive. 

In this paper, we propose the forgiveness mechanism that 
can be plugged into the existing buffer management 
schemes to remarkably enhance both throughput and 
fairness. Our mechanism does not require per VC 
accounting but per VC-state where only one bit is needed 
per VC. In a previous paper [15] we demonstrated the 
positive effect of our proposed forgiveness mechanism over 
EPD. Our focus her is demonstrating and analyzing the 
effect of the forgiveness mechanism when plugged into the 
Selective Drop scheme in low latency networks to enhance 
further SD performance. 


B. THE SELECTIVE DISCARD SCHEME (SD) 


SD evolved as an improvement over EPD policy. EPD 
assigns a threshold smaller than buffer size, below which 
no cell discard takes place. If the queue length exceeds this 
threshold, EPD drops all arriving cells that belong to new 
packets while cells belonging to partially received packets 
are still accepted [9]. EPD does not guarantee fair sharing 
of available bandwidth between contending TCP sources 
[1,13]. 

SD proposed by Goyal et Jain [1] ensures fair resources 
sharing by monitoring the share of each VC of the switch 
buffer. They deterministically drop cells belonging to 
greedy VCs in case buffer occupancy exceeds a pre- 
determined threshold R (similar to EPD threshold). 

SD keeps track of the activity of each VC by counting the 
number of cells from each VC in the buffer. A fair 
allocation is calculated as the (current buffer occupancy) 
divided by the (number of active VC’s), where an active 
VC is defined as the VC with at least one cell in the buffer. 
Let the buffer occupancy be denoted by X, and the number 
of active VCs be denoted by Na Then: 


SA. Mostafa is an M Eng. Student. Fair allocation = — 
a 
ee ee 
10" Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I a 


0-7803-6290-X/00/$10.00 ©2000 IEEE 


The ratio of the number of cells of a VC in the buffer to the 
fair allocation gives a measure of how much the VC is 
overloading the buffer i.e., by what ratio it exceeds the fair 
allocation. Let Yi be the number of cells from VCi in the 
buffer, then the Load Ratio of VCi is defined as: 

Load Ratio of VCi = (Number of Cells from VCi ) / (Fair 
allocation) 

If buffer occupancy exceeds the static threshold R, and the 
load ratio of some VC is greater than a parameter Z, then 
new packets from that VC are dropped, in preference to 
packets of another VC with load ratio less than Z. 

Figure (1) illustrates the buffer management policy of SD. 


4 
Packets may be discarded No packet discard 


Figure (1) SD discard policy 


G SD+: THE FORGIVENESS MECHANISM 


The forgiveness mechanism exploits a scheduling policy 
commonly used in operating systems. The scheduling 
algorithm tends to lower the priority of resources hungry 
processes in favor of lightly weighted processes. After a 
certain number of successive priority reductions, the 
scheduling algorithm forgets the past history of the heavy 
weighted process and treats it as a newly arriving job 
giving it the highest priority and a new chance to live [14]. 
We adapted the above concept to the ATM environment by 
assigning a single bit that we call the congestion bit to each 
VC. Whenever the buffer occupancy exceeds the static 
threshold R, newly arriving cells are continuously checked 
to see if they belong to greedy VCs (i.e. VCs that their 
buffer occupancy ratio exceeds the value of the parameter 
Z). If the newly arriving cell belongs to a greedy VC, it 
will be dropped and the congestion bit of its VC is set to 
one. 

In the next congestion episode, newly arriving packets 
belonging to VCs whose their congestion bit is set are not 
dropped, even if their share of buffer space exceeds the 
value of the parameter Z. This time greedy VCs with clean 
congestion bit are dropped while greedy VCs with a set 
congestion bit are spared. The accepted VCs will have their 
congestion bit reset while the dropped VC's will have their 
congestion bit set. Thus forgiveness mechanism has no role 
during the first congestion episode but it takes over during 
next congestion. Hence, forgiveness mechanism accepts 
new cells from sources with lower value of TCP CWND 
even when congestion condition is met. 

Floyd et Jacobson [10] state that window flow control 
protocols have a periodic cycle equal to the connection 
round trip time, and that this periodicity can resonate (i.e. 
have a strong non-linear interaction) with deterministic 
control algorithms in network gateways. They also show 
that drop tail gateways in a TCP/IP network with strongly 
periodic traffic can result in systematic discrimination 
against some connections. Goyal et all [1] demonstrate the 


effect of such periodicity on TCP throughput and fairness 
in ATM networks with tail drop and early packet discard 
drop policies. 

The forgiveness mechanism effectively helps reducing TCP 
synchronization effects described in [1] and [10] and helps 
promoting randomness element in the network, hence 
breaks down phase effects and diminishes resonance 
responsible for efficiency degradation. This _ is 
accomplished by selectively and deterministically dropping 
connections with larger CWND while keeping other 
connections untouched in one congestion episode and 
reversing this action in the next one, the thing that will 
also enable TCP sources to exploit a fair share of the 
available network capacity in turn. 

It is well known that successive congestion badly hurt TCP 
efficiency. As the size of SSTHRESH will dramatically 
decrease leading to a situation in which the slow start 
phase, where CWND size increases exponentially, will be 
too short increasing the time taken by the TCP source to 
reach steady state after congestion breakdown. The 
forgiveness mechanism remarkably increases throughput of 
TCP sources as it prevents successive reduction of 
SSTHRESH. 

The forgiveness mechanism requires minimal 
housekeeping overhead (one bit per VC) Also the logical 
condition it enforces can be easily applied in hardware 
which makes it compatible with ultra fast switching 
requirements of ATM networks and easily applicable in 
existing ATM switches. 

To evaluate the effect of the forgiveness mechanism, we 
use simulation, an approach commonly used in many other 
papers that dealt with the subject of traffic management of 
TCP/IP over ATM networks [1,2,3,7,9]. We use the 
ATM/HFC network simulator version 4.1, released 
December 1998, by the American National Institute for 
Standards and Technology -NIST. The simulator source 
code has been modified to add the forgiveness mechanism 
condition to the SD drop policy. 


D. SIMULATION MODEL, PARAMETERS AND 
PERFORMANCE METRICS 


Simulation model: 

All simulations presented in this paper use the N source 
configuration shown in figure (2). This model has been 
widely used in other papers discussing performance issues 
of TCP over ATM [1,2,3,7]. The model consists of N 
identical TCP sources that share a single bottleneck. The 
switches used in this model implement UBR service with 
either SD or SD+ (SD with the forgiveness mechanism 


Figure (2) The N-Source configuration 
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Simulation parameters: 

Selection of parameters is in accordance with the 
guidelines in [8]. The following simulation parameters are 
used: 


1) The configuration consists of N identical TCP sources 

as shown in figure (2). 

2) All sources are ftp sources with very large file sizes 
(30 M B); ftp is a very common traffic pattern, large 
file sizes ensure continuos data flow throughout the 
simulation period. 

3) ATM applications are assumed to be running over 
SONET STS-3c links with bandwidth equal to 155.52 
Mbps. 

4) The following link lengths have been tested: 0.8 Km, 1 
Km, 2 Km and 3 Km per link. 

5) Switches are assumed to implement SD/SD+ discard 
policies; R parameter is set to buffer size* 0.90 cells 
and Z parameter is set to 0.8;these values are selected 
because they ensure optimum results for efficiency and 
fairness as described in [1]. 

6) Switch buffer sizes of 1000 and 3000 cells have been 
tested respectively. 

7) Peak cell rate is 146.9 Mbps (after factoring in 
SONET overhead). 

8) Traffic is unidirectional, only sources can send data 
and destinations send acknowledgments only. 

9) TCP maximum segment size is set-to 512 bytes; this is 
the standard TCP segment size. 

10) TCP retransmission timer granularity is set to 100 ms. 

11) TCP RCVWND is set to 64 KB. 

12) All TCP sources start sending data at the same time 
and keep on sending data throughout the simulation, 
this pattern increases the probability of TCP 
synchronization. 

13) Simulated duration is 10 seconds. 

Performance metrics: 

Two performance metrics are used: efficiency and fairness, 

the same set of metrics have been used in several studies 

for performance measurements of TCP over UBR [1,3,7]. 

They also conform to the recommendations of the test- 

working group of the ATM FORUM [6]. 

Efficiency can be defined as the ratio of the goodput 

achieved by all TCP connections to the maximum 

achievable throughput of TCP over ATM-UBR, 
interpreting the above sentence to a mathematical formula, 
we get: E= pitied 

a & 


Where JV is the total number of TCP sources, X; is the 
goodput of TCP source number i and C is the maximum 
achievable throughput of TCP over a certain ATM link. 

The goodput of a TCP source is defined as the total number 
of successfully transmitted bytes (excluding 
retransmissions) divided by the simulation time. The 
maximum possible TCP throughput over UBR can be 


calculated as follows: For a 512 bytes TCP segment size, 
ATM layer receives the following payload: 


8 byte |20 byte IP| 20 byte TCP 
LLC header 
header 


512 byte |8 byte AALS 


This payload is padded to form 12 ATM cells, thus 512 
data bytes at the TCP layer results into 636 bytes at the 
ATM layer. So, the maximum possible throughput is 
512/636 =80.5 %=125.2 Mbps on a 155.52 Mbps link. For 
ATM over SONET, this number is further reduced to 120.5 
Mbps [1,11]. Fairness is measured by the fairness index F 
defined by: 


Fairness index (F) = 


Where e; is the fair share of source i of the available 
bandwidth for the N source configuration e; = C/N. A 
fairness value of 0.9 may or may not be accepted according 
to the nature of application and the number of sources 
involved. A fairness index of 0.99 is considered to be near 
perfect [1]. 


E. SIMULATION RESULTS 


We simulated 5 TCP sources with finite buffer switches. 
The simulations were performed using two values of buffer 
sizes 1000 and 3000 cells respectively, these values were 
chosen because they are closer to existing LAN switches 
and have been used in other studies [1]. For each buffer 
size, link length has been set to 0.8 Km, 1 Km, 2 Km and 3 
Km respectively to investigate the effect of propagation 
delay on overall Performance. 

The results are illustrated in the following tables: 


Table (2) SD+ fairness enhancement 
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Fig (3) Performance of SD+ Vs SD 
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Figure (4) SD+ performance (large number of sources) 


In case of large number of sources; we simulated 10 TCP 
sources with finite buffer sizes of 2000 and 3000 cells for 
link lengths of 0.8 Km, 1 Km and 1.5 Km, were 
considered. Simulation results are illustrated below. 
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Table (4) SD+ for large number of sources (Fairness) 


From tables (1), (2), (3) and (4) the following observations 
can be made: 

Buffer size is proportionally related to performance: It 
is clear that switches with larger buffer size lead to obvious 
increase in TCP efficiency and configuration fairness; 
under same network conditions. The switch with 3000 cells 
buffer capacity leads to 10% ~ 40% rise in the efficiency 
and 1% ~ 7% rise in fairness index than that of-2000 cells 
buffer size. 

SD+ remarkably enhances performance of TCP over 
ATM-UBR: Remarkable increase in efficiency is observed 
when applying SD+. This improvement is due to SD+ 
drop policy that breaks synchronization between TCP 


sources. Hence decreasing the probability of switch's buffe 
overflow which negatively affect TCP throughput as - 
forces TCP sources to enter the slow start phase decreasin 
CWND to 1 as described in section 2. SD+ mo: 
remarkable efficiency improvement is achieved in the cas 
of 2000 cell switches network where about 48% increase i 
efficiency is observed while the highest efficiency valu 
(0.94) is achieved in the network with 3000 cell buff 
switches. 

SD+ enhances fairness: SD+ increases efficiency but doe 
not sacrifice fairness, fairness value achieved with SD+ ; 
higher than that obtained when applying SD in most case: 
Even when SD achieves a suitable fairness index (0.99) i 
networks with larger switch buffer sizes, it achieves th 
same value or even increases it to the ideal value (1). Th: 
is due to SD+ drop policy which fairly allocates th 
bandwidth between the contending sources by granting th 
available network resources to TCP sources with lows 
CWND. 

SD+ has a steady state performance: It is clear fro! 
simulation results that SD+ has a steady state performanc 
in all configurations simulated where the values « 
achieved efficiency and fairness are almost constant, whil 
the performance of SD was fluctuating between goo 
performance and unexpected performance breakdown. 


F. CONCLUSION 


SD+ enhances efficiency and fairness over SD. Th 
positive effect is obvious in switches with lower buffer size 
(1000 & 2000 cell switches) where remarkabl 
improvement is observed. In case of switches with 300 
cells buffer size, the achieved values of efficiency an 
fairness are almost equal to the values achieved by SD. 
SD+ performance is stable in different mnetwor 
configurations while the performance of SD is fluctuatin 
when tested under the same configurations. 

The vision of future work includes Further testing for th 
forgiveness mechanism for various management policie: 
TCP segment sizes and link lengths 
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Parallel Genetic Algorithm for Optimizing Run-time 
Reconfigurable Circuits 


Hubert Frohlich, Andrej KoSsir, Baldomir Zajc 


Abstract—In this paper a methodology for finding the 
maximal common subgraph of two directed graphs 
with parallel genetic algorithm is discussed. The 
method is directly applicable to the optimization of 
configurations of FPGA (Field Programmable Gate 
Array) circuits in Run-Time Reconfigurable systems. 


The problem of finding the maximal common sub- 
graph is known to be NP-complete. The advantage of 
our approach is that we find optimal or near-optimal 
solutions in polynomial time using a genetic algo- 
rithm. Since the cost function of the optimization 
task is multimodal, an implementation of parallel ge- 
netic algorithm assures significant improvments of the 
results, 


Keywords— reconfigurable circuits, parallel genetic al- 
gorithm 


I. INTRODUCTION 


The technology of FPGAs (Field Programmable 
Gate Arrays) was introduced in the mid 1980s. 
It’s success was mainly due to it’s capability for 
implementing significantly more logic then other 
programmable devices and could easily implement 
multi-level logic while most other programmable de- 
vices were optimized for two-level logic. 

Recent families of FPGAs offer a new feature called 
run-time reconfigurability (RTR) which means that 
they can be reconfigured partially and during run- 
time. This powerful feature enables many new ap- 
plications [4] which are conceptually different from 
applications implemented in classical FPGAs. 


An FPGA device consist of a mesh of configurable 
blocks called Configurable Logic Blocks (CLBs) and 
of a powerful network of connections which is also 
fully programmable [3]. 


Since the cost function describing reconfiguration op- 
timization problem is multimodal, an implementa- 
tion of parallel genetic algorithm is required. Simple 
genetic algorithm usually converges to local maxima 
which is, according to experimental results, signif- 
icantly different from global maxima and approxi- 
mation to the solution is not good enough for real 
applications. This problem is due to high probabil- 
ity of destructive crossover and can be solved by an 
appropriate control of flow of chromosomes which is 
known as a parallel genetic algorithm. 


University of Ljubljana, Faculty of Electrical Engineer- 
ing, TrzaSka 25, 1000 Ljubljana hubert.froehlich@adacta.si, 
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II. BASIC DEFINITIONS 


We introduce a definition of a directed graph and 
refer to it simply as graph. A graph G is a pair 
(V(G), E(G)), where V(G) is a non-empty set of el- 
ements called vertices, and E(G) is a finite set of or- 
dered pairs of distinct elements of V(G) called edges. 
An embedding y induces a map y : E(G) 7 
X x X, given by (u,v) = (p(u),y(v)). A pair 
(y(V(G)), p(E(G))) is again a graph denoted by 
y(G) and is called, embedded graph. 

Let X be a set, G; and G2 graphs with embeddings 
yi : G, > X and ye : G2 — X. An embedded 
common subgraph is a subgraph H of graph G with 
embedding y : H + X such that y(H) is a common 
subgraph of graphs y;(G) and y(G2). A maximal 
embedded common subgraph is an embedded com- 
mon subgraph which is not a proper subgraph of any 
embedded common subgraph. 

In later sections an embedded common subgraph is 
used in description of common parts of FPGA cir- 
cuits, where a set X is chosen to be a discrete struc- 
ture describing FPGA circuit. 


III. PARALLEL GENETIC ALGORITHM 


Genetic algorithm (GA) is a stochastic optimiza- 
tion procedure used for solving discrete optimization 
problems. Our application of GA is to solve the prob- 
lem of finding a maximal common subgraph (MCS) 
of given graphs. This optimization problem has an 
optimization space that is to large for any determin- 
istic optimization procedure. 

As already mentioned in the introduction, when mul- 
timodal cost function is optimized a premature con- 
vergence to local maxima is encountered. This is the 
case here with our reconfiguration problem. A so- 
lution to this problem is an implementation of the 
parallel genetic algorithm. 

When two completely different chromosomes are 
crossed a probability of resulting chromosomes be 
destroyed is relatively high. It means that in this 
case the given approximation to the solution is very 
likely to be destroyed. To prevent such destructive 
crossings a population at each step of the algorithm 
is partitioned into subpopulations and the rate of 
exchange between subpopulations is controlled. The 
implementation of this feature requires a modifica- 
tion of the procedure chose() for random choosing 
pairs of chromosomes to be crossed. The probability 
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distribution P[c2 = chose(c1)|c=c,] is of key impor- 
tance. 


IV. OPTIMIZATION 


In this section basic notation of discrete optimization 
problem is presented, see [5]. 
Let ® be a set and let 


P:d5R 


be a map to extended real line IR = IR U {—00, oo}. 
A set © is called optimization space and function P 
is called cost function. 

Define a set 


Min(®, P) = {x € ®; Vy Ee ®: Ply) > P(z)}. 
and a minimal value of the cost function 


infzce P(s), & # 0; 
‘ G = 0. 


min(®, P) = { 


The following equality is obvious 
min(®, P) = {x € ®; P(x) = min(®, P)}. 
We also define 


Maz(®,P) = Min(®,—P) 
maz(®,P) = —min(®,—P). 


A pair 
(&,.P) 


is called an optimization task. To solve the optimiza- 
tion task (®, P) it means to complete at least one of 
the following tasks: 

1. find Maz(®, P); 

2. find at least one x € Maz(®, P); 

3. find maz(®, P); 

4. find such z € © that the distance between x and 
maz(®, P) is sufficiently small. 

If a set ® is a discrete set, an optimization task © is 
called discrete optimization task. 

When the size of a set ® is huge it may happen 
that the computation complexity of the discrete opti- 
mization procedure exceeds any reasonable value. A 
stochastic optimization procedure should be applied 
then. We have introduced the genetic algorithm and 
it turned out that it is suitable for finding maximal 
common subgraph of a given pair of graphs. This 
optimization task will be presented in later sections. 
Genetic algorithm optimization procedure solves an 
optimization task (®, P) as formulated in the state- 
ment 4.) in the above list of possible forms of solu- 
tions. 


V. PROBLEM STATEMENT 


Run-time reconfigurability improves functional den- 
sity (number of functions per number of gates) but 
requires time for partial reconfiguration. Since the 
speed is one of important factors we have to mini- 
mize the time needed to reconfigure the device from 
one configuration to another. 

The most straightforward way to minimize the re- 
configuration time is to find the largest common part 
of all needed configurations and reconfigure only the 
parts of the device which do not implement the com- 
mon functions. 

In order to minimize reconfiguration time we need to 
find a maximal common subgraph of all graphs that 
represent all configurations needed for the applica- 
tion by following the rules from VI. 


VI. REPRESENTING FPGA CONFIGURATIONS 
WITH GRAPHS 


Every FPGA configuration is a digital circuit. Rep- 
resenting digital circuits on a gate level with a di- 
rected graph is straightforward: 

e every output of a gate is represented by a vertex, 
e every connection is represented by an edge. And 
edge start at the vertex where the connection is an 
output of a gate and ends at the vertex where it 
represents an input of a gate. 

e every input to the circuit is represented by a node 
with fanin 0 and fanout > 0 

e every output to the circuit is represented by a ver- 
tex with fanin > 0 and fanout 0. 

Let S be aset of basic logic elements from digital cir- 
cuits S = {AND, NAND, OR, NOR, NOT, DFF, 
TRI, IN, OUT}. 

Every vertex in the graph represents one element of S 
and every edge in the graph represents a connection 
between two gates or gate and input or output. 

In order to find a common subgraph of given graphs 
G, and G2 an optimization space ® is chosen to be 
a set of all common subgraphs of G; and Gz. We 
denote it by ® = CSub(G, G2). 

CSub(Gi,G2) is obviously a discrete optimization 
space. Cost function P is defined to be a measure of 
the size of a given subgraph H € CSub(G, G2). One 
of the most critical factors in discrete optimization 
task is the size of the space |®]. 

We now estimate the size of the discrete space 
CSub(G,,G2). The number of nonidentical (p,q) 


graphs is 
(cr 
q; 


where p is the number of vertices and q is the number 
of edges in the graph. The number of all nonidentical 
graphs with p vertices is equal to 


9p(p-1) 
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If we assume G, = Go, then a set of common sub- 
graphs C'Sub(G,, G1) equals to a set of nonidentical 
subgraphs of a graph G. If the graph G2 equals to 
full graph Ky, then we have 


\CSub(Gi,G1)| = 2?@-0), 


This is the upper limit of the size of our optimization 
space. 

Since a graph with p vertices has at most p(p — 1) 
edges an inequality qg < p(p-— 1) is true. Let H be 
the maximal common subgraphs of graphs G, and 
G2 with p, = |V(H)| and gq, = |E(H)|. In this case 
the size of the space equals to 


Ps(Ps—1) if 
jesu(@r.Gyi= yy (??,”) 


k=0 


and the upper limit as the worst case estimation is 
|CSub(Gi,G1)| < 27-9, 


As we have demonstrated the worst case estimation 
of the size of the space shows that its size is very 
large. This is the main reason to apply a Genetic 
algorithm as an optimization procedure. 


VII. OPTIMIZATION PROBLEM AND GA 


In this section we give a description of the applica- 
tion of the Genetic algorithm for solving the above 
presented optimization task. 

Let G; and G2 be two graphs of which we are search- 
ing the MCS. 

We define our coding map as M : ® — C, where 
C is a coding space C = {0,1,..., M}% where N is 
the number of vertices in the smaller graph and M 
is the number of vertices in the larger of graphs Gj 
and Go. 


Let V(Gi) = {1,2,3,4,5,6,7,8}, V(Go) = {1,2, 
35/4,,056,.7 }, E(Gi) ca {(1, 5), (2, 5), (3, 6), (4, 6), 
(5,7), (6, 7), (7,8)} and E(G2) = {(1, 5), (2,5), 


(3,5), (4,6), (5,6), (6, 7)}- 

Now we define a Node Correspondence List (NCL). 
NCL is a list of ordered pairs of elements from IN 
which correspond to vertices in G; and G2. It has 
the following properties: 

1. For every pair of graphs there exists at least one 
NCL. 

2. For every pair of graphs there is finite number of 
NCLs. 

3. Every NCL together with G, and G2 defines a 
subgraph 

4. Every node from G, in a NCL corresponds to at 
most one node from G2. 

An example of NCL for the graph mentioned above 
would be {(li, 12), (31, 45), (21, 22), (51, 52)}. 


For every pair from E,4 and Eg we say that it is 
contained in NCL iff the first index of a pair from 
E(G,) forms a pair in NCL with the first index of 
pair from Eg and the second index of a pair from 
E(G;) forms a pair in NCL with the second index of 
a pair from E(G2). For example pair (1,5), (1,5)2 
is contained in NCL shown above but (3,6)1, (3, 5)2 
is not. 

To find which common subgraph is represented by 
this NCL and G; and G2 we apply the following 
rules: 

1. If the pair from E(G,) and E(G2) is contained 
in NCL and none of the vertices from smaller of the 
two graphs is already in the graph add the starting 
vertex, the ending vertex and the edge between them. 
The vertices have the same indexes as the pair from 
the smaller graph has. 

2. If the starting vertex is already added to the sub- 
graph, add the edge and the ending vertex. Make 
the index of the ending vertex the same as the end- 
ing index of the pair from the smaller graph. 

3. If the ending vertex is already added to the sub- 
graph, add the edge and the starting vertex. Make 
the index of the starting vertex the same as the start- 
ing index of pair from the smaller graph. 

The Cost function depends on the target FPGA and 
is based on the number of allowed graph operations 
needed to come from one graph to another. 


Genetic operator Selection 


For selection we implemented a classical roulette 
wheel based procedure as can be found in [2]. 


Genetic operator Mutation 


Operator mutation is fairly straightforward. The 
procedure is shown below: 

1. Chose a random position k within NCL. 

2. Generate a set Ey = {1...M}\ (NCL \ {0}) 
where M is the number of edges in the larger graph. 
3. Select at random one value from Ey and a ran- 
dom value k where k € {1...N} where N is the 
number of edges in smaller graph. 

4. replace k — th element from NCL with the value 
chosen from Ey. 

This procedure is necessary because NCL cannot 
have two values, other than zero, to be equal. This 
is a consequence of property 4 of the NCL. 

The effect of mutation is that it adds or removes an 
edge from a common subgraph or sometimes it moves 
it within the subgraph. 


Genetic operator Crossover 


This operator is slightly more complicated than mu- 
tation. Again we have to perform a crossover in such 
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a way that no illegal chromosomes will be generated. 
The procedure for crossover is as follows: 


1. Generate a set Ec of possible cutpoints (positions 
in NCL that will not produce an illegal chromosome). 
Possible cutpoints are from 1 to N-1, where N is the 
number of vertices in the smaller graph. 

2. Select at random one value from Ec. 

3. Exchange the parts of chromosomes to the right 
from the cutpoint. 


Such a crossover proved to be the best from the ones 
we tried because it was the only one that took the po- 
sitional value of genes in chromosomes into account. 
It also gave us the best results. The other crossovers 
we tried were used in the traveling salesman problem 
but they did not perform well because they were to 
destructive when implemented with our coding table. 


VIII. RESULTS 


Finding the MCS is not enough for the optimiza- 
tion task to be solved. From MCS we have to cal- 
culate the number and type of operations needed to 
reconfigure the FPGA. All these parameters have to 
be taken into account when evaluating the fitness of 
the chromosomes. To apply our algorithm to find an 
MCS of n graphs we need only to change the function 
for evaluation of the fitness of the chromosomes. This 
is easily done by applying the rules for constructing 
the MCS from NCL to all n graphs at the same time. 
Usually the overall reconfiguration time (that is the 
time needed to sequentially configure FPGA in ey- 
ery needed configuration) is minimized except in the 
following cases: 


e When some reconfigurations are not time critical. 
In this case we can simply exclude them from the 
fitness function. 

e When two or more configurations can fit into the 
target FPGA at the same time. We can treat those 
configurations as one and represent them as one non- 
connected graph. No additional requirements are 
needed for searching the MCS of the non-connected 
graphs using our algorithm. 


We define groups of circuits A={G,; = 8208.1 
(ISCAS’89) G2 = s27 (ISCAS’89)}, B={G, = s208.1 
(ISCAS’89) G2 = s288 (ISCAS’89)}, C={G, = mesh 
5x5, G2 = mesh 4 x 4}, D={G, = full (20 nodes), 
G2 = full (50 nodes) }, E={G, = full (20 nodes), G2 
= full (20 nodes)}. The table shows example results 
for this groups of circuits. 
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where t is execution time, P, crossover probability, 
Py mutation probability and |P| the size of genera- 
tion. 

We can see that for more irregular graphs like those 
representing ISCAS’89 circuits the mutation prob- 
ability had to be kept low and the size of genera- 
tion did not improve the performance significantly 
while with regular graphs with a lot of connections 
the generation size is bigger and mutation probabil- 
ity significantly higher. Experiments also show that 
with richly connected sircuits (big average fanout 
and fanin) preformance of PGA tends to be better 
than with poorly connected ones. 


es] 


IX. CONCLUSION 


We presented a method of optimization of configura- 
tions of Run-Time Reconfigurable FPGAs based on 
finding the maximal common subgraph of the graphs 
representing the FPGA reconfigurations. We used 
parallel genetic algorithms for optimization due to 
the very large optimization space. 

The results show that configuration time savings 
when one FPGA is used for more than one function 
are considerable. 

The method we presented was for a single FPGA de- 
vice but can easily be extended to a larger systems. 
Also our algorithm for searching for MCS can be gen- 
eralized to search for MCS of any number of graphs. 
The only change needed in the algorithm would be 
the cost function. These improvements are also the 
guidelines for our future work. 
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Time Domain Performance Evaluation of 
Adaptive Hybrid Cache Coherence Protocols 


Danko Ivosevic, Sinisa Srbljic, and Vlado Sruk 


Abstract —Adaptive hybrid cache coherence protocols use 
both the write-invalidate mechanism and the write-update 
mechanism to maintain coherence among copies of data 
objects. Each of these protocols implements a decision function 
that chooses the appropriate mechanism in order to improve 
their performance. In most existing solutions, decision 
functions are based on communication traffic. Moreover, the 
authors of the adaptive protocols use communication traffic as 
a performance measure in their papers. 


In contrast, in this paper we present the results of a 
performance evaluation of adaptive hybrid cache coherence 
protocols in both the traffic domain and in the time domain. 
We compare three adaptive protocols with pure write- 
invalidate and pure write-update protocols. Let ry, , ryy, and 
r, be the average communication traffic per access for the 
write-invalidate protocol, the write-update pretecol, and the 
adaptive protocol, respectively. The adaptive protocol 
minimizes the traffic if ryemin(ry), ryy). Similarly, the adaptive 
protocol minimizes the access latency if La2min(tyy, ty), where 
twp Gy, and L, are the access latencies for the write-invalidate 
protocol, the write-update protocol, and adaptive protocol, 
respectively. For some of the workload parameters the 
adaptive protocols minimize both traffic and access latency. 
However, we also present and analyze the workload 
parameters for which adaptive protocols minimize 
communication traffic, but fail to minimize the access latency. 


Index Terms — Adaptive hybrid cache coherence protocol, 
decision function, time domain performance evaluation. 


I. INTRODUCTION 


In recent years, most computer systems, from tightly- 
coupled multiprocessor systems to loosely-coupled 
distributed systems, based on either local networks or 
global networks (Internet), use data replication and caching 
to improve their performance. Two factors that affect the 
performance of a system with replicated data are the data 
access behavior of the application and the choice of the 
cache coherence protocol. Cache coherence protocols 
maintain coherence among copies of the same data object. 
Application developers and users either choose the cache 
coherence protocol appropriate for the given data access 
behavior, or tune the application to run efficiently with the 
given cache coherence protocol, 


D. Ivonevic, 8. Srbljic, and V. Sruk are with the School of Electrical 
Engineering and Computing, University of Zagreb, Zagreb, Croatia, 
E-mail, divose@yahoo.com, (sinisa, sruk}@zemris. fer, hr 


Most cache coherence protocols use one of two basic 
coherence mechanisms: the write-invalidate mechanism or 
the write-update mechanism. Adaptive hybrid cache 
coherence protocols use both mechanisms. Each of these 
protocols implements a decision function that chooses the 
appropriate mechanism in order to improve the 
performance of the adaptive hybrid system. We evaluate 
three adaptive cache coherence protocols, called hereafter 
RWB adaptive protocol [1], EDWP adaptive protocol [2], 
and APCUM adaptive protocol [3] (Adaptive Protocol 
using CUMulative cost). The performance of the adaptive 
protocols is compared with the performance of the pure 
write-invalidate protocol and the pure write-update 
protocol. We simulate the cache coherence protocols for a 
network of workstations [4]. Since the decision functions of 
most adaptive protocols choose the mechanism that incurs 
the lowest communication traffic, the authors of the 
adaptive protocols choose traffic as the performance 
measure. 


However, for most applications the final performance 
goal is not to minimize the communication traffic, but to 
reduce the access latency. Therefore, we decided to 
evaluate the adaptive protocols in both the traffic domain 
and the time domain. We chose the average traffic per 
access as the performance measure in the traffic domain 
and the average latency per access as the performance 
measure in the time domain. 


The simulation model and basic coherence 
mechanisms are presented in Section 2. Section 3 briefly 
describes the adaptive cache coherence protocols. We 
discuss the simulation results in Section 4. Final comments 
on the performance evaluation are given in Section 5S. 


Il. THE SIMULATION MODEL AND BASIC CACHE 
COHERENCE MECHANISMS 


We simulate a distributed system, where seventeen 
workstations share one data object [5]. One of the seventeen 
workstations, denoted as the sequencer [4], performs global 
sequencing in order to ensure coherence among the copies 
of a given data object. In addition, the sequencer is the 
home location for the given data object. All of the other 
sixteen workstations are called clients. If a client does not 
have a copy of the data object that it needs, it sends a 
request to the sequencer. Requests that are cache hits (both 
reads and writes) are executed /ocal/y, while other requests 
that are cache misses are called remote. A local access does 
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not incur communication traffic and, in addition, we assume 
that the latency for a local access is equal to zero. 


The seventeen workstations have a bandwidth of 10 
Mb/s (megabits per second) available for communication. 
The workstations communicate via packets of three 
different sizes: the size of a small packet is 45 B (bytes), the 
size of a medium packet is 90 B and the size of a /arge 
packet is 720 B. Small, medium, and large packets transmit 
command messages, updating information, and data objects, 
respectively. Workstations exchange packets in order to 
maintain cache coherence. For example, if a client reads a 
data object from the sequencer, it takes: one small packet 
containing the read request sent from the client to the 
sequencer and one large packet containing the data object 
sent from the sequencer to the client. Since a broadcast 
sends packet to each workstation one by one, the 
communication traffic for the broadcast is sixteen times 
higher than the traffic for the basic packet transfer. 


If an access is a write, the write-update protocol 
updates the caches associated with all workstations [4]. 
Therefore, after a cold start, all reads in the steady-state are 
cache hits, and they execute locally with no communication 
traffic and have zero latency. Since all writes broadcast the 
updating information to all sixteen workstations, writes are 
always remote accesses. 


If an access is a write, the write-invalidate protocol 
updates only the local cache associated with the workstation 
that performs the write [4]. All other copies in the caches 
associated with other workstations are invalidated, by 
broadcasting the invalidation command message to all 
sixteen workstations. If there are no intervening accesses 
from other workstations, after the first write access all 
successive reads and writes from the same workstation hit 
to the only valid copy in the whole system. Therefore, these 
accesses are local, and they execute locally with no 
communication traffic and have zero latency. In addition, 
second and all successive reads from the same workstation, 
without intervening writes from other workstations, are also 
local. The first read or write, after a write from another 
workstation, is remote. If multiple workstations have valid 
copies, then the write is a remote access that broadcasts the 
invalidation command message to all workstations. 


III. ADAPTIVE CACHE COHERENCE PROTOCOLS 


Most decision functions are based on a sequence of 
consecutive accesses to a data object. Copies of a data 
object are updated until a certain number of consecutive 
writes is performed by the same workstation, such that no 
intervening accesses are performed by other workstations 
[1-2], after which the write-invalidate protocol is used. For 
example, we simulate the RWB adaptive protocol [1] whose 
decision function switches from the write-update protocol 
to the write-invalidate protocol after three successive writes 
from a workstation without intervening writes from other 
workstations. We also simulate the EDWP adaptive 
protocol [2]. The difference between the RWB protocol and 


the EDWP protocol is that not only write accesses from 
other workstations, but also the read accesses, are used to 
prevent switching from the write-update mechanism to the 
write-invalidate mechanism. Both adaptive protocols switch 
from the write-invalidate mechanism to the write-update 
mechanism on the first intervening access from another 
workstation. 


If there are no intervening accesses, then only one 
workstation accesses the data object. Therefore, it is better 
to use the write-invalidate protocol than the write-update 
protocol. The write-invalidate protocol sets the state of the 
local copy as the only valid copy on all workstations, and 
all successive reads and writes from the same workstation 
become local with zero traffic. If the write-update protocol 
is used, then the traffic is not zeroed and depends on the 
probability of write accesses, because each write access for 
the write-update protocol is remote. 


However, if there are intervening accesses from other 
workstations, the write-update protocol generates lower 
traffic than the write-invalidate protocol. If the interleaving 
of accesses from different workstations increases, the traffic 
generated by the write-update protocol does not change, 
because it depends only on the probability of write 
accesses. In contrast, the traffic generated by the write- 
invalidate protocol increases. After a write from one of the 
workstations, all other copies are invalidated. If a 
workstation hits to a copy that is invalidated, the whole 
valid copy should be transferred from the sequencer or 
another client (a client can also have the only valid copy, 
while the sequencer’s copy is invalidated). The traffic cost 
for transferring the copy could be high and it depends on 
the size of the data object. Therefore, the traffic for the 
write-invalidate protocol is significantly affected by the 
intervening accesses from other workstations. 


In addition, we also simulate an adaptive protocol 
called APCUM, whose decision function is based on the 
cumulative communication traffic [3], The decision 
function chooses between the write-invalidate protocol and 
the write-update protocol based on which mechanism incurs 
lower cumulative communication traffic. The decision 
function implements two counters for each data object, One 
counter keeps the cumulative traffic for the write-invalidate 
protocol, and another counter keeps the cumulative traffic 
for the write-update protocol, Although only one of the 
protocols (write-update or write-invalidate) is running at the 
time, both of these two counters are updated, To maintain 
the cumulative traffic, the appropriate communication costs 
are added to the counters for each read and write access. 
The APCUM protocol switches to the mechanism whose 
counter has a lower value. 


Results of the simulation and time domain analysis 
presented in the next section show that the previously 
described adaptive protocols may minimize both the traffic 
and the latency for some workload parameters, but there are 
workload parameters for which they will fail to minimize 
the access latency, 
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(a) Average traffic per access 


(b) Average access latency 


Figure 1: Simulation results 


IV. PERFORMANCE EVALUATION 


The analysis of real applications shows that the 
workstations perform accesses in bursts, i.e. only one 
workstation accesses the data object at a time [3]. 
Therefore, we use Dubois & Wang burst model [6] to 
define the artificial workload. We define that the artificial 
workload generates the accesses in bursts, where the burst 
size (number of accesses in access burst) is determined by a 
Normal (Gaussian) distribution (mean, standard 
deviation). When a workstation completes a burst of 
accesses, the probabilities of performing the next access 
burst are the same for all seventeen workstations. 


Figure 1 shows the results of simulation using the 
following workload parameters: the burst size is equal to 
N(11, 1). The average size of a burst, i.e. the number of 
accesses performed by a workstation without intervening 
accesses from other workstations, is equal to eleven. The 
standard deviation is small and equal to one, which means 
that in most of the access bursts there are eleven accesses. 
Both figures give the simulation results for different 
probabilities of write accesses, ranging from 0% to 100% 
(for example, 20% is denoted as 0.2 and 100% is denoted as 
1). We assume that workstations perform on average ten 
accesses per second. We present the graphs for the write- 
invalidate protocol (WI), for write-update protocol (WU), 
and all three adaptive protocols (RWB, EDWP, and 
APCUM). 


Figure 1a presents the average communication traffic 
per access in numbers of bytes. As discussed in the previous 
section, for large access bursts the traffic for the write- 
invalidate protocol is not significantly affected by the 
percentage of write accesses. Figure 1a confirms that the 
average traffic for the write-invalidate protocol does not 
change significantly when the probability of a write access 
is increased. Since each write is a remote access for the 
write-update protocol, the average traffic for the write- 
update protocol increases with the probability of a write 
access. If the probability of write accesses is greater than 


10%, the average traffic for the write-update protocol 
becomes larger than the average traffic for the write- 
invalidate protocol. Since all of the simulated decision 
functions minimize the average traffic, the average traffic 
for the adaptive protocols is similar to the minimal value 
between the average traffic for the write-update protocol 
and the average traffic for the write-invalidate protocol. 


Figure 15 presents the average access latency in 
milliseconds (ms). Although the average traffic for the 
write-update protocol is higher than the average traffic for 
the write-invalidate protocol for probabilities of writes 
between 10% and 90%, the average access latency for the 
write-update protocol is lower than the average access 
latency for the write-invalidate protocol. There are two 
reasons for this: 


1. The write-update protocol uses smaller packets per 
access than the write-invalidate protocol. Smaller 
packets use network bandwidth more efficiently. The 
write-update protocol uses only medium and small 
packets, while write-invalidate protocol uses large and 
small packets. 


2. The latency of a remote access for the write-update 
protocol is lower than the latency of a remote access 
for the write-invalidate protocol. To calculate the 
latency of a remote write access for the write-update 
protocol, we need to add the latency of a medium 
packet (updating information sent from a client to the 
sequencer) and the latency of a small packet (write 
permission command packet sent from the sequencer to 
a client). Different types of remote accesses for the 
write-invalidate protocol have different latencies. For 
example, if a client reads a valid copy from the 
sequencer, one small packet travels from the client to 
the sequencer (read request command message) and 
one large packet travels back (data object response 
packet). If a client reads a valid copy from another 
client, the latency is doubled, because the 
communication between two clients is performed 
through the sequencer: a read request message travels 
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from the requesting client to the sequencer (one small 
packet) a read request message travels from the 
gaquencer to the client that has the only valid copy (one 
small packet), the data object travels from the client 
that has the only valid copy to the sequencer (one large 
packet), and the data object travels from the soquencer 
to the requesting client (one large packet)’, 


However, ifthe percentage of writes is higher than 
QO%, the access atency for the write-update protece! 
Qurgrows the access latency for the write-invalidate 
protocol, Since each write is a remote access for the write- 
update protocol, the aceess latency for the write-update 
protocol increases with the probability ofa write access. In 
addition, the access Latency for the write-update protecal is 
imereased, because the increase in traffic Increases the 
average time that the packets wait in queues in order to be 
gene through Retwork. 


AS we Can see from the Figure 1, the adaptive protocols 
minimize doth traffic and access latency for percentages of 
writes that are ether less than 10% or greater than 90%. For 
the percentages of writes less than 10%, the adaptive 
protocels use the write-update protocol, which incurs lower 
waffic and lower access latency than the writeinvalidate 
protocel, For the percentages of writes higher than 90%, the 
adaptive protocols use the write-invalidate protocol, which 
incurs lower traffic and lower access atency than the write- 
update protocol. 


For the percentage of writes higher than 10% and lower 
tham 90%, the adaptive protocols minimize the traffic, but 
GQ mot mamimize the access latency. The adaptive protocols 
ws2 the writ-invalidate protece], which incurs lower traffic 
than the write-updare provocol. Since the writ-invalidate 
proaecel, the adaptive protocols incur high access latency. If 
the percemtage of write accesses is between G0%e and 80%, 
the access latency for adaptive protocols is even higher than 
the access latency for the pure write-invalidate protocol 
Dumng the first three accesses im 2 burst, the RWB and 
EDWP protocais use the write-update protocol, Since there 
are Be imterweming accesses from other workstations. after 
three successive write accesses from the same workstation 
the adapdve provocals switeh to the writinvalidare 
pretecel. Therefore, the first access im the next burst must 


* Wie cam estimate the values for the average ancess latency 
presenind em Figure | based om the packet semuice time (packer 
SZe im Dits exer Qamdwadth im bits per secend). The average 
latemey far the wie access fer the write~updane promeal is 
(QQBS5) (20M SO. los. Fer example. if 30% of accesses are 
writs tte average access latency is Q. Ins 2=0. 0d (Note that 
local witht zene latemcy).. Let ws assume that 100% of accesses are 
writs. Since te frst write access im 2 burst is remote and ail offer 
tem winhes are local wtih zero latemey (tere are 2] accesses in 
Quest), dte average access latemey for the writinvalidate protocol 
iS (2X72UB+2n4530(1 Dx 10M Sh ims. 


read the only valid copy from another workstation. Sin: 
the first access in the burst has the same latency as the m 
access of the write-invalidate protocol, and since there = 
two successive accesses that have the same latencies »- 
accesses for the write-update protocol, the average later: 
for the adaptive protocol is higher than the average later) 
for the pure write-invalidate protocol. 


Since it is not possible for the APCUM protocol! © 
maintain precise cumulative traffic counters, the adapi= 
protocol unnecessary switches between the write-inva!stc 
protocol and the write-update protocol, which increase: < 
average access latency. 


V. CONCLUSION 


The results of simulation presented in this paper > 
that it is not possible to design an adaptive cache cohemece 
protocol that minimizes both the average traffic per anzes 
and the average access latency. There are workload 
parameters for which the write-invalidate protocol 
minimizes the average traffic per access, while the wre- 
update protocol minimizes the average access latenc\ 
Since the analyzed decision functions switch between 
write-update protocol and the write-invalidate protoc: 
order to minimize the communication traffic, the acce: 
latency for the adaptive protocols may outgrow the avese 
access latency of both pure write-invalidate protoco! ar 
pure write-update protocol. 
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Coefficients’ Computation for Jot's 
Reverberation Algorithm 
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Fig. 3 Unitary feedback network 


For unitary feedback networks, the condition of preserving 
sound’s energy is assured by choosing a unitary matrix A: 


A.A’ =I. (2) 


Jot has completed the unitary feedback network, using the 
gain coefficients k;, responsible for the walls and air 
absorption of the sound. In this case, the equation takes the 
form: 


Y(z)=k-Z ™" -X(z)+A-k-Z™-¥(z) (3) 


ky 


where k= 


Introducing the coefficients k;, the poles of the unitary 
feedback network will move from the unit circle. It is 
interesting to place the new poles on another circle. This 
property is beneficent when thinking about ringing that 
could appear at certain frequencies, if some poles would 
have modulus closed to unit. In order to control the 
modulus of all poles in the same manner, it is possible to set 
the corresponding gains based on: 


aM 
[nee pT, (4) 
where / is the desired radius. 


Considering Jot’s structure, the gains will determine the 
reverberation time 7r concordant to: 


—3m.T 


k,=10 7 (5) 


The algorithm has been improved by replacing the 
coefficients k; with some low pass filters h,(z), simulating 
the frequency dependent absorption applied to sound. The 
correspondence between the reverberation time 7r and the 
absorptive filters is expressed in this case, by: 

lA (e Mies paced LY (6) 

Tr(@) 

Finally, the t(z) filter from figure 4, was introduced with the 
purpose of equalizing the transfer function, allowing to 
define a desired global frequency characteristic of the 


reverberator [2],[3]. Figure 4 presents the complete 
structure of Jot’s algorithm. 


Fig. 4 Jot’s reverberator 


III. COEFFICIENTS’ COMPUTATION 


The creating process of Jot’s reverberator could start from 
reproducing the early reflection part of the impulse 
response and then trying to tailor the decaying envelope of 
the late reverberation. 

Early reverberation of a rectangular room could be 
approximated analytically, for low frequencies at least, 
using the source image method [4]. Then, because the ear 
feels time proximate impulses within less than 1 ms, as a 
single vector composed impulse, early reflections pattern 
could be further simplified by combining adjacent taps of 
the impulse response, which are separated by less than Ims 
[4]. Considering the amplitude and time of such impulses 
elements, as t; and u;, the resulted pulse for two closed taps 
can be approximated using equations 7 and 8. 


2 2 
tip Uy +1in “Ujn 
bales rasgte anak tag (7) 
f 2 2 
Un TU 


Uy = Ju, EN ist é (8) 


Jot’s reverberator design expects to assign values to the 
parameters from the bloc diagram: [bj], [hi], A, [ci]. 
Instead of h; absorptive filters, a simplified structure could 
use, without significant differences, absorptive gains k;, in 
conjunction with a unique low pass filter, connected at the 
input of the feedback network. In this case, the cut 
frequency will be the average of the cut frequencies of all 
absorptive filters. 

A one pole low pass filter, defined by equations 9, is 
enough to simulate air absorption of sound [4]. 
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d is the mean distance swept by sound, corresponding to a 
proper amount of adjacent taps of the impulse response and 
Cis the sound speed in air. 

The absorptive gains k; will give the desired shape to the 
late reverberation, based on the equation 5, providing 
proper value of reverberation time. 

For testing the coefficients’ computation, a unitary matrix 
A was used (equation 10), which simulates the reflection on 
the adjacent walls. 

A 4x4 unitary matrix suggests a bidimensional propagation 
of sound in a rectangular room. 


OS tie al at, 
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(10) 
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For treating other examples it must be noticed that the main 
diagonal of A matrix corresponds to the situation when 
sound sweeps twice the distance between two consecutive 
reflections. This case corresponds to the location of the 
source and listener in the center of the room, so a non-zero 
main diagonal A matrix is too particular to be useful. 

In a first implementation, the other parameters of the 
reverberator [b;] and [c;] were computed considering the 
first 4 taps from the feedback network impulse response. 
A simple equation has to be-solved (equation 11),but it 
must be mentioned that [c;] parameters are considered all 
equal to a constant. 


Bek, = uy 
bycyk, = Uy 
; (11) 
b,c,k; = U, 
byCyk, = Uy 


A multiplication factor a has been introduced in order to 
correct the reverberation time, determined as [1]: 


4 
Tr =4-a-Yt,. (12) 
i=1 


The impulse response provided by the reverberator in this 
case is shown in figure 5. The simulated room was a 
10x10x6 m’ cube. 

Trying to reproduce the first 8 taps of the ideal early 
reflections, an 8x8 equations system could be written, based 
on the Z transform of the transfer function of Jot’s 
reverberator (equation 13). 


b-c-k =u(1: 4) 13) 
Ree DAcpe(@eaen) keno: 
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The equivalent equation system 14 proves that the system is 
incompatible. Anyway considering only the first 7 taps, 
equations 14 provide an infinity of solutions, separated by a 
multiplication factor. 
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Fig. 5 The impulse response provided by 
identifying the first 4 taps 
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Fig. 6 Impulse response provided by 
identifying the first 7 taps 
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For DSP implementation it is useful to have coefficients [5] 
smaller but closed to 1, so the greater element of vector [bj], 
respectively [c;] will be chosen 1-2°N , N being the word 
length for a specific processor. Then the other coefficients 
of the corresponding two vectors will be normalized. 

The impulse response for the resultant coefficients is shown 
in figure 6. 

A disadvantage that could rise when trying to give a 
specific shape to late reverberation, in the meaning of 
increasing reverberation time, is the possibility to obtain for 
[b;] and [c;] complex values. 


IV. CONCLUSIONS 


Using some simple computations we determined the 
parameters of Jot’s reverberator, by approximating the early 
reflections by the source image method and at the same 
time, setting the desired reverberation time as a frequency 
function. 

Some of the approximations that have been used at the 
design level do not affect significantly the reverberated 
sound perception and others represent convenient solutions 
regarding the cost of implementation (the source image 
method for impulse response generation, vector 
composition of adjacent taps, partial identification of 
impulse response taps). 

Using the proposed solution, the possibility of adjusting the 
reverberator parameters depending on the desired room 
characteristics and source-listener position, is offered. 
Another advantage of this algorithm is the improved echo 
density, which can be enhanced by choosing a proper 
dimension unitary matrix. 

Small matrices could also be combined with other methods, 
to improve late reverberation characteristics, as an 
alternative to larger unitary matrices. 

As future experiments, all pass filters or nested all pass 
filters will be cascaded with t(z) equalization filter as an 
alternative for echoes density enhancement. 
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Petri Net Based Modelling and Simulation of 
Email Alert System 


Juraj Puksec', Miljenko Opsenica’, Domagoj Frank’, Vlatka Maricic’, 
Kimon P. Valavanis”, Senior Member, IEEE, Ignac Lovrek*, Member, IEEE 


Abstract — The paper presents modelling, simulation and 
checking of Email Alert System service by using Petri nets. Petri 
net models are used in the early design phases for formal 
verification of the service solution characterised by concurrent 
processes and threads, as well as hardware parallel ability. Case 
study discussing call design approach is included. 


Index Terms — Internet service, email, Petri net, modelling. 


I. INTRODUCTION 


The integration of Internet services and Telephony services 
opens a new area in telecommunication for the development 
of services such as Email Alert System (EMA System). The 
EMA System service eliminates the need for permanent 
Internet connection. The. EMA System checks users 
mailbox on the mail server and once a new email has been 
detected, the user will be informed over the phone [1, 2]. 
The Email Alert System developed for implementing this 
service consists of seven modules. The communication in 
the EMA System is based on Component Object Modei 
(COM) technology, which enables global applications 
independence from programming languages and full 
network transparency. Problems related to such 
implementation are concurrency between processes, threads 
and hardware parallel ability. 

This paper deals with modelling, simulation and checking 
of the EMA System solutions by using Petri nets. The main 
goal of derived Petri net models has been to verify formally 
the proposed solutions, particularly their liveness property, 
in early phases of the project in order to avoid potential 
deadlocks and possible conflicts. 

The paper is organised as follows: Section II deals with a 
description of the EMA System. Voice Machine Module 
chosen as a case study is presented in Section III, while its 
Petri net model is given in Section IV. Section V discusses 
simulation results. Conclusions are given in the last section. 
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II. EMAIL ALERT SYSTEM 


One of the most popular Internet services is email, but its 
deficiency is the inavailability of service. User has to 
connect to the Internet to check for new email without 
knowing if any or some specific arrived. Also, it is not 
possible to access Internet at any time and at any place. The 
EMA System offers solutions for users by periodically 
scanning user’s mailbox on the mail server and informing 
users about arrived emails via telephone. 

A user of the EMA System can be anyone with a telephone 
subscription and an Internet accessible email account on a 
mail server that supports POP3 or IMAP4 protocol. 


Web Handler 


Figure I —- The EMA System in three layer structure 


The EMA System consists of seven modules: Web 
Interface, Console, Web Handler, Database, Controller, 
Mail Checker and Voice Machine (Fig. 1). The EMA 
System processes can be divided into three main layers: 
presentation, application and data layer. 

Users can register to the EMA System service via the Web 
Interface or via the Console. User data is stored in Database 
through the Web Handler. The Web Handler immediately 
sends an email to the user carrying his/her Personal 
Identification Number (PIN). When the user gets his/her 
PIN number he/she can activate the EMA System service. 
After the activation the Controller picks up user data from 
Database and requests from the Mail Checker to scan the 
user’s mailbox (or mailboxes). The Mail Checker sends a 
message to the Controller after detecting of new email. The 
Controller reads the user’s notification phone number and 
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sends a notification request to the Voice Machine. When 
the user receives the call from the Voice machine he/she 
must enter his/her PIN number. If the PIN number is correct 
a voice message will be played. Users also had to decide 
will the service stay active or will he/she deactivate it. 

The EMA System is implemented on a Win32 platform [3]. 
Modules were developed by using C, C++, HTML, ASP, 
Java Script and VB_ Script. The EMA System 
implementation includes Computer Telephony Integration 
(CTD, programming interface, concurrency control and 
synchronisation demands. Modules are independent and do 
not necessarily reside on the same host. Network 
transparency and differences in data representation are 
resolved by COM middleware technology [4]. The 
Controller communicates with other modules by using 
COM, except communication with the Database is through 
ActiveX Data Object (ADO) using SQL queries. 


III. CASE STUDY: VOICE MACHINE MODULE 


The Voice Machine acts as an interface between a user and 
the EMA System that uses a Public Switched Telephone 
Network (PSTN). 

The hardware part of the Voice Machine includes an 
integrated add-on board with voice processing capability, 
and an analogue or digital line interface. 

The software part of the Voice Machine handles requests 
from the Controller and initiates notification call towards 
service user, Figure 2. Notification call is state-machine 
controlled. If new email is detected, the Controller sends the 
request to the Voice machine to initiate an alert call by 
passing user’s UID (user identification number), telephone 
number and PIN. The Voice Machine starts alert call on the 


DIAL 
Dial State 


OFFHOOK 
Hook State 


IDLE 
Idle State 


ONHOOK 
Hook State 


PLAY_GOODBYE 
Play State 


PLAY_INTRO 
Play State 


FallBack 


PLAY_OFFSERVICE 
Play State 


concurrent processes 


Controller process Voice Machine _/chi] 


Job Distribution 
Thread 


MailChecker 

Database 
a 

[rh n] 


concurrent threads 


Figure 2 — Modules communication in the EMA System 


available channel by triggering the transition from stat 
IDLE to OFFHOOK, in the Call State diagram, Figure 3. | 
all channels are busy, the request will wait till timeot 
occurs. Call State machine controls voice processing, ca 
establishment, call progress, user keypad input and billin 
information collection from the telephone exchange. UIL 
Call outcome (successful notification, user busy, mn 
answered etc.) and service activity flag (if service is activ 
or not) is then sent back to the Controller. 


ITV. DESIGN OF CALL STATE DIAGRAM 


The Voice Machine Call State diagram is shown in Figur 
3. Call State diagram contains 14 states divided in thre 
main classes: Hook, Play and Digit plus Idle and FallBack. 
Request from the Controller initiates transition from IDL 


DIG_PIN PLAY_INVPIN 
Play State 


Digit State 


PLAY_CONFPIN 
Play State 


PLAY_SERVICE 


Play State 


PLAY_INVSERVICE 


DIG_SERVICE 
Play State | 


Digit State 


Figure 3 — Call State Diagram 
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to OFFHOOK state which starts the call by setting the 
telephone line “off hook”. If the operation is successful, 
state machine transits to DIAL which initiates dialling of 
user’s extension number. If the call is answered, 
PLAY_INTRO state plays the voice message which 
requests PIN to be entered by using keypad. DIG_PIN state 
collects keypad entries and tests if the correct PIN is 
entered. If the user provides correct PIN, confirmation 
message is played by PLAY_CONFPIN state. Otherwise, 
PLAY_INVPIN plays rejection message. If the correct PIN 
is entered, the notification is considered successful. After 
PIN is checked, user can deactivate the service or leave it 
activated by providing appropriate keypad input. Input is 
tested in similar manner as PIN input. If user decides to 
stop the service, PLAY_OFFSERVICE notifies that the 
request is processed. At the end, PLAY_GOODBYE greets 
the user and ONHOOK ends the call. The state machine is 
put to IDLE state which sends the call outcome to the 
Controller. 

In order to minimise the number of states, all events that do 
not pertain to “regular” call sequence initiate transition to 
FallBack state. Those events are the following: unanswered 
or busy call, user hang-up during call, timer exceeded, too 
many retries and any error or unexpected event. 


P3 - OFFHOOK 


P5 - DIAL 


ty @ 
P4 - if some error occu! a P6 - 
6 


Pi - IDLE 


P23 - if some e 
some error occurs 


P24 \ PLAY_GOODBYE 


t23 


P7 - PLAY_INTRO 


if some error occur: 


P22 - PLAY_OFFSERVICE 


P21 - deactivation of service 


V. PETRI NET BASED MODELLING AND SIMULATION 


In order to verify the module formally, call state diagram is 
transformed into Petri net [5, 6, 7], Figure 4. The main goal 
of derived Petri net models has been checking the formal 
properties of the proposed solution, particularly liveness, in 
early phases of the project in order to avoid potential 
deadlocks and conflict activities. 

Petri net is constructed by transforming the Call State 
machine diagram. All states are transformed into places and 
they are connected with transitions. Inhibition arcs were 
used for modelling error conditions. This is explained by 
the fact that from every regular place transition is possible 
to next regular place but if some error occurs the call drops 
to the FallBack place. 

Visual Object Net ++ [8] tool has been used for testing the 
EMA System modules. Different situations are investigated 
by simulating Voice Machine operations. The most 
important test cases defining successful and unsuccessful 
call are described by transition sequences and short 
explanations, as follows: 


P12 - PLAY_INVPIN "10 


utd P11 - invalid PIN number 


17 


P15 - PLAY_SERVICE 


P17 - DIG_SERVIC. 
P18 - if some error occurs fe) 


wes 


P20 - PLAY_INVSERVICE 


Figure 4 — Petri net of the Voice Machine module 
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1) Regular call flow 


Regular call flow means that user enters his/her PIN and 
SERVICE number correctly. The following transition 
sequence describes regular call flow: 


pi p3e ops py apo Ig 
pis—!!5_,py7—123_, pog_124_,po7_127_, py. 


The call is considered as successful. 


2) Call flow with errors 


All errors happening during the call initiate the call release 
and the call drops to the FallBack state. An example 
showing error during notification call, before PIN entering, 
(P80) produces the following transition sequence: 


pops 2 ops_© ppp 8 gpg 459727 


The call is considered as unsuccessful. 


3) User enters invalid service number 


User enters invalid service number (P19#0). After entering 
the correct number, the call is terminated regularly. The 
corresponding transition sequence follows: 

pis pgs pp Ss pes po SES pigs Spats, 


P17,PI9—_, poop 2, py7_—23_, pag 124, py7_127_, py 


The call is considered as successful. 


4) Service deactivation 


Users decision to deactivate the service (P2140) initiates 
the following transition sequence: 


pi—!_,p3_8 ,p5_ py 217 po 19 pig 13 gS 
t17 t22 t24 t27 


P17,P21——> P22 P24 5 P27__>P. 


IDLE condition is restored. 


5) User enters invalid PIN 


User enters invalid PIN number (P1140). After PIN 
correction user decide to deactivate the service. The 
following transition sequence is produced: 


pi—_,p3_8_,ps_5_,p7_t7_.p9 py 10 pit pg 9 


3 us tl7 t22 24 t27 


P13———»P15——>> P17, P21 ——» P22 <<» p2g__*_, pp 127. py 


The call is considered as successful. 


There are no deadlocks in the model. Conflict transitions 
are only transitions that model regular call continuation and 
error-initiated call termination, such as: (t1, t2), (t3, t4), (t5, 
t6), (t7, t8), (t9, t10, t12), (t13, t14), (t15, t16), (t17, t18, 
t20, t23), (t21, t22) and (t24, t25). All these conflicts are 
intentional. They describe mutually exclusive actions. 

The Petri net proves to be live, and there are no deadlocks 
in the model. 


From the simulation results we can conclude that the 
constructed Petri net is error-free. Only after the Petri net 
simulation was performed, the programming of Voice 
Machine module took place. 


CONCLUSION 


Modelling, simulation and checking of Email Alert Internet 
services by using Petri nets is described in the paper. Call 
model for Voice Machine and its Petri net model are 
elaborated as a case study. Petri net models were used for 
clarifying service concepts and implementation. Formal 
verification and simulation based on derived Petri net 
proved to be useful in early design phases. Petri net models 
and the analysis based on them were found to be an 
efficient way to improve design process and project 
controllability. This is important especially for new telecom 
services combining features from different networks, such 
as Internet and telephone network. 
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On the Benefits of Kernel Extraction 
During Logic Optimization 


Riccardo Scarsi, Enrico Macii, Member, IEEE 


Abstract—Kernel extraction during logic minimization, 
although beneficial in most cases, may not be applicable 
to certain classes of functions due to the restructuring 
effects this operation may have on the Boolean network 
being optimized. In this paper, we identify and 
characterize a class of logic networks for which “blind” 
kernel extraction does not pay off in terms of both 
power and area minimization. More specifically, we 
provide experimental evidence that arithmetic 
operations are negatively affected by kernel extraction. 


Index terms—Logic synthesis, digital circuit 


I. INTRODUCTION 


Kernel extraction is an algebraic operation that is normally 
applied to Boolean functions in the process of logic 
optimization. In particular, it is exploited, with different 
flavors, by the most efficient scripts distributed along with 
logic optimization tools. 


Unfortunately, kernel extraction does not always produce 
beneficial effects on the modified Boolean network. There 
are cases in which avoiding, or at least applying with care, 
this operation would be preferable. In this paper, we 
identify and characterize a class of logic networks for which 
“blind” kernel extraction does not pay off in terms of both 
power and area minimization. 


We start our analysis by considering simple exclusive-or 
(XOR) functions; then, we extend it to the more general 
case of arithmetic functions such as addition and 
multiplication. We limit our analysis to the case of single- 
output functions; this is because we would like to prevent 
logic sharing to influence the results of our investigation. 


The rest of the paper is organized as follows. Section II 
introduces the concept of kernel of a Boolean function, and 
summarizes how kernel extraction can be done. Section III 
presents experimental data regarding the impact of kernel 
extraction on power consumption and area of the optimized 
network. Finally, Section IV concludes the manuscript. 


The authors are with Politecnico di Torino, Dip. di Automatica e 
Informatica, Torino, ITALY 10129. 


II. KERNEL EXTRACTION 


The extraction of common sub-expressions in a Boolean 
formula relies on the search of common divisors of two or 
more expressions. If one is found, then the divisor can be 
extracted to represent a new node of the Boolean network 
and a corresponding variable can be used to simplify the 
original expressions. 


The notion of kernel of a Boolean expression, introduced by 
Brayton and McMullen in [1], has an important role for 


multi-level logic optimization. 


A Boolean expression is said to be cube-free if it can not be 
factored as a product of a cube and another expression. 


A kernel of an expression is a cube-free quotient of the 
expression divided by a cube, which is called the co-kernel 


of the expression. 


As an example of kernel extraction, consider the expression: 


f =ace+bce+de+ g 
Dividing f by e yields: 


ip ney ae bce+d 


which is cube free. Hence ac+bc+b isa kernel of f 
and @ is the corresponding co-kernel. 


Similarly, dividing f by ce yields: 
ae ae Maur b 


which is cube-free. Thus, a + b is another kernel of f and 
cé is the corresponding co-kernel. 


In power and area optimization, the value of a kernel is 
defined as the reduction in the number of literals in the sum- 
of-product (SOP) representation of the network when the 
kernel is extracted. 
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For power and area optimization procedures are available 
for extraction of multiple-cube common divisors from the 
network and for re-expressing the network in terms of such 
cubes in order to reduce the total number of SOP literals. 
Kernel extraction is iterated until no divisor whose value 
exceeds a given threshold exists. 


Ill. EXPERIMENTAL ANALYSIS 


We first consider a set of generic functions (taken from a 
combinational control circuit implemented with sparse 
logic). We apply kernel extraction to each function 
(considered in isolation) through the S/S [2] command: 


gkx -t threshold 


Parameter threshold indicates the value of the kernels to 
be considered in the optimization loop. In other words, only 
kernels with values greater than threshold are exploited. 
In our experiments, we pick increasing values of 
threshold (increment step of 5). The so-obtained 
technology-independent networks are then mapped for 
minimum area onto a very simple technology library 
containing only two-input gates using command: map -m 
0. Finally, we collect circuit area and power dissipation data 
using S/S and IRSIM [3], respectively. 


The results of our analysis are pictorially shown (power data 
only, normalized with respect to the value obtained with 
threshold = 0 — area results follow similar curves) in the 
diagram of Figure 1, and they indicate the beneficial effects 
of extraction: The smaller the value of threshold, the 
lower the power (and the area). 


We now focus on XOR functions with 6, 8, 10 and 12 
inputs, and we run the same experiment. As shown in the 
diagram of Figure 2, conversely from what happens for the 
generic functions, power (and area) decrease almost 
monotonically as the value of threshold increases, 
indicating that kernel extraction negatively impacts these 
two parameters of the design space (the best result is 
obtained with threshold = o, that is, no kernel is 
extracted). 


As expected, this behavior is not peculiar of XOR functions; 
it extends to more general arithmetic operations, such as 
addition and multiplication. Consider, for example, the 
various Output signals of an 8-by-8 multiplier and of a 16- 
bit adder. The corresponding normalized power dissipation 
diagrams are depicted in Figures 3 and 4, respectively. 


In the case of the multiplier circuit, the negative influence of 
kernel extraction is even more emphasized than for the 
XOR's. On the other hand, the curves of the adder circuit 
are sometimes a bit more “noisy”, due to the fact that some 
output bits are characterized by functions that have a less 
regular (i.e., more sparse) topological structure. 


The general conclusion we can draw from our analysis is 
that, although kernel extraction has great potential for logic 
optimization in most cases, when the target is the 
implementation of arithmetic units, commonly adopted 
synthesis scripts should no longer be used; rather, a careful 
selection of each optimization step should be carried out in 
order to achieve the best results. 


IV. CONCLUSION 


Kernel extraction during logic minimization, although 
beneficial in most cases, may not be applicable to certain 
classes of functions due to the restructuring effects it has on 
the Boolean network being optimized. 


In this paper, we have identified a class of circuits, namely, 
those implementing arithmetic operations, that are 
negatively sensitive to kernel extraction. We have proved 
that, for this kind of circuits, kernel extraction should be 
performed with care when the ultimate goal is power and 
area optimization. 
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Figure 1: Power Dissipation of Generic Functions for Varying Values of threshold. 


Figure 2: Power Dissipation of XOR Functions for Varying Values of threshold. 
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Figure 3: Power dissipation of a 8-by-8 Multiplier for Varying Values of threshold. 


Figure 4: Power Dissipation of a 16-bit Adder for Varying Values of threshold. 
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Efficient Service Management Systems In 
Future Open Competitive Communication 
Systems 


A. Taskaris, P. Demestichas, E. Tzifa, M. Theologou 


Abstract—Efficient management will be a key factor for the 
future open competitive communications systems. The 
separation of service providers and connectivity providers 
requires the optimum usage of network resources. This 
separation of roles allows service providers to offer services 
with different QoS and to charge their services according to 
the QoS they require. In this presentation a solution for the 
resource reservation depending on users’ requests is given. 
Index Terms—TMN, QoS, SQL, AQL, ORB, Service 
Provider, Connectivity Provider 


I. INTRODUCTION 


In the prospect of a free market, it is a matter of time that 
connectivity providers will not be providing services to 
users exclusively. Various providers might offer services to 
users by renting the resources from connectivity providers. 
Service providers inorder to work successfully and to be 
able to afford competition have to efficiently operate the 
services that users request from them. This means that they 
utilize the resources rented by connectivity providers in a 
way that not only satisfies their users’ needs, but also is the 
most financially profitable to them. The profits of a service 
provider depend strongly on the cost of the resources rented 
from the connectivity providers. 

In order to achieve their goals, that is to satisfy users’ needs 
and to reach the highest possible profit, service providers 
use design tools [1] and service management systems for 
the resources they rent from connectivity providers. 

This paper presents general concepts related to service 
management, as well as related systems and problems. In 
the first section a brief description of the 
Telecommunication Management Network (TMN) 
architecture is given [2,3,4,5,6,7,8,9,10]. In the second 
section a description of the service management system is 
given, and pertinent definitions are given. In the third 
section the resource reservation problem in the context of 
service management is stated in a high level, in a 
mathematical manner and solved. In the final section the 
conclusions of our survey are given. 
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II. TMN COMPONENTS 


In this section we briefly outline the TMN concepts that 
relevant to the rest of the paper. The TMN architect 
defines entities and the communication between them fi 
the functional, information and physical points of view. ’ 
different management layers of the TMN architecture 
starting from the higher level: business, service, netw 
and element. Each layer comprises components that r 
fall in the configuration, fault, accounting, performar 
and security management functional areas. Th 
components may be used for building more comp 
management services. 

A brief description of the TMN architecture is given 
Figure 1 


Configuration | Fault | Accouuing | Performance | Security 


Business Management 
Layer (BML - OSF) 


Service Management 
Layer (SML - OSF) 


Network Management 
Layer (NML - OSF) 


Element Management 
Layer (EML - OSF) 


Figure 1: TMN description 


Our study will be focused on the service and the netw 
layer. The need for QoS offered to users and the need 

configuring the network and handling network trai 
efficiently, are the reasons that make those two Th 
elements particularly interesting for our study. As it can 
seen from Figure 1, the service layer of the Th 
architecture sends its requests to the network layer. 

Each user that requires services from its service provi 
increases the demand for network resources and alters 
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network traffic. The service provider, in order to satisfy its 
users’ needs, has to demand from the network layer the 
appropriate resources. 

In the network layer, the appropriate resources have to be 
allocated, in order to be able to satisfy service providers’ 
demands. This might imply the reconfiguration of network 
resources. 


Il]. SERVICE MANAGEMENT ISSUES 


In this part of the document, a brief description of the issues 
that are related to the service management layer is given, as 
they are defined from the TMN standards. A description of 
the interface with the underlying layer (network layer) is 
given, as well. 

The service management layer, according to TMN 
specifications, is responsible for service ordering and 
provisioning of customers, for dealing with billing matters, 
for administrating customers, for keeping customer’s 
information and for managing workforce. Our approach of 
service management offers a more flexible way of ordering 
services to users. They have the ability to request services 
with a specific QoS and they will be charged according to 
the QoS that they request. 

The service management layer accepts requests from users 
that want network connections with various QoSs and sends 
to network layer requests for allocating the necessary 
resources. The requests concerning the network layer are 
made through an Object Request Broker (ORB), which is a 
platform that accepts and forewords requests from one layer 
to another. By means of the ORB platform service layer and 
network layer are completely independent. The ORB 
interface between TMN layers simplifies their 
communication. The ORB architecture [11,12] allows two 
layers to operate as separate objects, to communicate with 
each other without knowing the internal processes of the 
other object. It also allows programs to operate on objects 
without knowing the internal operations of the other object. 

The network layer communicates with the transport 
network, which is the iowest part, in our network 
architecture, for the physical allocation of resources. If the 
resources are available they are reserved for the particular 
service provider. The reservation of new resources demands 
reconfiguration of the network. In order to offer high 
quality services, the evaluation of new traffic and the 
changes that are necessary should take place in an 
acceptable time limit. A brief description of the service 
architecture is given in Figure 2. 

Our study is focused on the service ordering part of service 
management layer. The architecture that was used in our 
study could be also used for handling problems that have to 
do with any of the other sectors of service management 
layer. For example, it could be used for mapping customers 
and payment methods when users demand various ways of 
payment, depending on the services they order or on the 
amount of their expenses, 

The chosen architecture was based on the use of an ORB 
platform as the interface between the various network 


management layers. This is a good solution for network 
management, because the various layers could be 
completely independent, keeping no information about 
other layers. It also makes interoperability easy, which is a 
very important factor in managing networks containing lots 
of different elements. 


Service Provider 


Service Management Layer 


Object Request Broker 


Network Management Layer 
Transport Network 


Connectivity Provider 


aoe 


Figure 2: Service Layer-Network Layer 


IV. RESOURCE RESERVATION 


The problem that arises with the separation of connectivity 
providers and service providers deals with finding the 
appropriate mapping between them and between sessions 
demanding the same QoS and connectivity providers, in 
order for the service providers to work successfully. The 
parameters of mapping service providers to connectivity 
providers are mainly the benefit that each service provider 
has when using resources from a specific connectivity 
provider and the cost of the reservation of resources. 


A. Problem description 


The QoS that users require and the correct rate policy of the 
resources is the main concern of service providers. Each 
service provider in order to efficiently utilize the rented 
resources, has to allocate them from various connectivity 
providers, according to the QoS of the sessions that users 
request. Sessions that require higher QoS and which are 
more expensive have to be serviced from the resources that 
can offer such QoS and the sessions that require a lower 
QoS could be serviced from the cheaper resources. By using 
this strategy users pay according to the QoS they require, in 
other words, according to the quantity and the cost of the 
resources that must be allocated whenever they are using the 
network. 

Each user belongs to a certain Service Quality Level (SOL) 
according to the QoS he desires and each SQL can be 
serviced from one or more Application Quality Levels 
(AQL). Sessions belonging to an SQL could be serviced 
from various connectivity providers, depending on the QoS 
that each connectivity provider offers. In others words, a 
mapping has to be made between the SQLs and the AQLs 
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and between the sessions of an SQL and the various 
connectivity providers. The optimal mapping could be 
reached by taking into consideration the cost of allocating 
resources, the benefit that a user may have when using a 
particular resource or the quantity of resources that have to 
be allocated. 


B. Problem formulation 


The mathematical description of the problem mentioned on 
the previous paragraph is given in the following section: 

Let S be the set of Service Quality Levels (SQLs), A be the 
set of Application Quality Levels (AQLs), A(i) be the AOL 
to which a session can be assigned; s(i) be the number of 
sessions of type i which should be served, bs,(i,j) be the 
benefit from assigning SQL-i to AQL-j, rs,(ij) be the 
resources needed if SOL-i is assigned to AQL-/, pc(r) be the 
cost for occupying r resources from connectivity provider C 
and C be the set of Connectivity Providers. 

Find an assignment of SQLs to AQLs, A, and an assignment 
of SQL sessions to Connectivity Providers, B, that optimizes 
the objective function: f(A, B) 

The objective function f(A,B) is related to the cost of the 
assignment of SQLs to AQLs and to the cost of the 
assignment of sessions to Connectivity Providers. 

Supposing that each SQL is assigned to a single AQL, that 
all sessions of each SQL should be served from a 
Connectivity Provider, that sessions of type / served from 
Connectivity Providers have a certain cost, the resources 
that are reserved from a Connectivity Provider are 
calculated. 

A formulation of the mathematical problem could be the 
following: 

The decision variables are the following: 

rs) = if SQL-i is assigned to AQL-j, 0 otherwise 


Vac lOrirk) =1 if k units of session type i are serviced from 
RC 


Connectivity Provider C, 0 otherwise 
The objective function that should be maximized is: 
2 ys Xu D by GI- » De rac(©)) 
ieS jeA(i) ceC 
The conditions that should be satisfied are: 
Vx G/=aLieS 


jeA(i) 


Di S.(c,i) =s(i),ieS 


ceC 


Dard) = Sy, (ik) ek 
k=l 
Pel) = Sach A) VXsulbA* rlb jk, VeeCc 


JeA(i) 


C. Solution methods 


The problems of mapping SQLs to AQLs and SQL sessions 
to connectivity providers, are exponentially increasing and 
therefore the use of computationally efficient heuristic 
algorithms is necessary [13,14,15]. Only in cases that the 
number of SQLs, AQLs, SOL sessions and connectivity 
providers is small, an exhaustive search can be used in 


order to find a solution. Otherwise the response time of the 
exhaustive search is impractical and the reconfiguration of 
the network cannot be done quickly and efficiently. In our 
study we used the Taboo algorithm [16,17] and the 
Simulated Annealing algorithm [18,19]. By means of these 
algorithms a near optimal solution could be reached within 
a polynomial time. The results were satisfying, compared to 
those of the exhaustive search, and their advantage is that 
they reach a solution quicker and they require less 
computational power. These advantages are vital for the 
network management, because a quick response of the 
algorithm means quicker management reconfiguration, 
therefore better network utilization and less rejected 
requests, and the less computational power implies 
decreased expenses for network management hardware. 

By means of the Taboo algorithm, the best solution 
according to certain criteria is chosen in each step, but the 
rest of the solutions are not rejected. They are stored in a set 
of possible solutions. In the following steps not only the 
best solution of the previous step is taken into 
consideration, but the other possible solutions, as well. By 
using this method the local optima can be avoided and it is 
more possible to reach the best solution. 

According to the Simulated Annealing algorithm, not only 
the best solution of each step is kept for the following steps, 
but some other solutions are kept for the calculation of the 
following steps. The criteria of keeping these solutions is 
the value of a possibility function. In each step of the 
algorithm not only the best solution of the previous step is 
calculated, but also the solutions that have been kept are 
taken into consideration. By means of this method local 
optima can be avoided and the best solution could be easily 
reached. 

The criteria for choosing the best solution are the benefit 
that a service provider might have by allocating resources 
from a connectivity provider, the cost of the resources that 
the connectivity provider offers, the number of the SQLs, 
the number of AQLs, the number of sessions per SQL and 
the number of connectivity providers. 


V. CONCLUSIONS 


In our study we deal with the problem that rises when 
service providers rent their resources from various 
connectivity providers and they offer services to users that 
are rated according to the QoS they require. The parameters 
that are included are the number of SQLs, the number of 
AQLs, the number of sessions per SQL, the number of 
connectivity providers, the cost of allocating resources and 
the benefit that a user has when allocating certain resources. 
In the network architecture on ORB platform has been used, 
in order to make the interface between the service and the 
network management layer completely independent and to 
make interoperability easier to implement. By means of this 
network architecture users can request services of certain 
QoS, service providers are able to offer such services by 
selling them in various prices, depending on the QoS they 
require and they have the ability to manage the resources 
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they rent from connectivity providers more quickly and 


efficiently. 
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Automatic Equipment for Final Testing of 
Telephone Pair Gain Systems 


Andrej Levstek, Member, IEEE, Savin Gorup, Marko Jankovec, Zarko Gorup, Member, IEEE 


Abstract—This paper describes a practical solution of final 
testing of Digital Subscriber Pair Gain (DSPG) systems. These 
devices help to overcome the last-mile bottleneck of tele- 
communication networks. DSPG systems must comply with 
rigorous regulations of national communication administrations 
to be permitted in public use. The architecture of the system 
consists of a custom measuring unit, switching network and a 
supervisory PC. An artificial telephone line has been developed 
to simulate UTP cables in the length of 3.5 and 4 km. Total 
harmonic distortion (THD) of the ringing signal is measured 
with a microcontroller using a method that is explained in detail. 


Index Terms—telephone measurements, artificial line, pair gain 
system, THD, UTP, ISDN, HDSL 


I. INTRODUCTION 


Final testing of sophisticated electronic equipment is an 
important issue of providing the high degree of quality 
required by the market. Digital Subscriber Pair Gain (DSPG) 
systems have to comply with numerous requirements imposed 
by national telecommunication administrations. The 
equipment should be operating over an extended temperature 
range since the remote terminals are usually installed in 
outdoor cabinets. Pair gain systems are used to increase the 
number of telephone channels being transmitted over existing 
unshielded twisted-pairs (UTP) formerly used for analog local 
loops. High-bit-rate digital subscriber loop (HDSL) is used 
for the transmission of multiple analog subscriber loops over 
a UTP line as depicted in Fig. 1. 


Subscriber 


terminal terminal [f 


hOSDOSDODOOCOOCOSOOH 


ST 


Subscriber 
telephones 


Telephone exchange 


Fig. 1. Pair gain system for analog subscriber local loop 


The analog telephone loop is usually referenced as plain old 
telephone service (POTS). With the expanded demand for 
ISDN lines a new DSPG system for digital local loop (DLL) 
for BRA has been developed. The glossary concerning these 
systems is slightly different and is revealed in Fig. 2. Reliable 
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Fig. 2. Pair gain system for ISDN over U-interface 


testing of ET-ST (COT-RT in the ISDN case) couples 
involves numerous measurements that require special and 
expensive equipment. The testing of large quantities of 
finalized equipment imposes long hours of tedious work to 
relatively high skilled staff that inevitably leads to fatigue and 
mistakes. As soon as large production volumes are expected 
the development of special automated testing and measuring 
equipment is justified. The benefits of computer controlled 
quality inspection are numerous but the most important one is 
to maintain the production output flawless at least at the 
moment of shipping. 


II. SYSTEM ARCHITECTURE 


A. General Design Requirements 


The described test equipment hardware and software was 

developed to control the production of recently developed 12 

channel POTS and 5 channel ISDN-BRA DSPG systems. The 

same equipment should accommodate both systems fulfilling 
the following main requirements: 

1. The hardware should be capable of testing up to 56 
systems with the possibility of reducing this number. In 
every configuration the number of inserted pair gain 
couples should be detected. A tiny version of the testing 
device should be considered too. 

2. COT and RT of each couple should be connected via an 
artificial transmission line (ATL) simulating up to 4 km 
of UTP. Attenuation of the line has to be adjustable in 
order to imitate the most frequent gauges of UTPs. 

3. After devices under test are inserted (plugged in) 
manually the testing device performs all the tests 
automatically without human surveillance. 

4. The testing procedure is. controlled by a personal 
computer (PC). Measuring results are collected and 
stored on the hard disk. Brief results and progress of 
testing should be shown on line on the screen. Exhaustive 
reports in appropriate printed as well as electronic 
formats should be available. 
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B. Measurements Specific to DSP'G Systems for POTS 

On every synchronized set of UT and ST for transmission of 
12 analog subscriber loops numerous tests both on channel 
and unit level have to be carried out, Most of the tests are 
initiated by the custom design equipment at one terminal and 
measured at the corresponding channel of the other device, 
Due to limited space only a brief list of required 
measurements and tests per channel is specified bellow, 


1, Delay and distortion of dialing pulses 

2, Amplitude, frequency and 77//D of ring, voltage 

}, Frequency response in both directions 

4, Gain variation with level in both directions 

5. Magnitude of loop DC currents on both sides 

6, Functional test of the PORTA signaling system 

7, Detection (ST) of the inversion of the DC feeding voltage 
polarity (HT) 

8 Transmission of CLIP signaling » 
calling telephone number 

9, Amplitude and timing distortion of tariff tone pulses 


identification of the 


On the terminal level the following properties of each couple 

of connected devices has to be verified; 

1, Metallic bypass on the first channel via the inter 
connecting copper pair in case of supply voltage failure 

2, Capability of the remote power feeding system for the 
worst case of loading of the subscriber terminal 

3, Performance of the communication port (S485) of the 
HT device that is used for internal supervision and 
control 


C. Required Tests of ISDN Devices 

For each channel of these devices the following tests and 

measurements have to be performed; 

|, Synchronization of the U interface and bit error rate 
(BER) test of the established connection, 

2, DC value of the feeding voltage at the U interface 
terminals of the RT device, Dynamic properties of the 
current limiting of power feeding at the same terminals, 


On the ISDN pair gain module level the same tests as above 
should be carried out, with the exception of the metallic 
bypass which is not implemented in these devices, 


D, Design considerations 


Based on the functional requirements, mechanical restrictions, 
recommendations for public telecommunication networks [4], 
[5] and overall flexibility of the design the architecture shown 
in Fig. 3 was determined, At this point we will focus only on 
main problems neglecting the less interesting but still 
important details such as powering of the testing system and 
the devices under test, The testing equipment for ISDN and 
analog DPG systems was designed as a single multipurpose 
testing appliance although the required measurements of each 
type of device are quite different, Both systems have the same 
HDSL interface, remote power feeding and controlling 
software, The same is also the layout of the connector of COT 
and ET devices that are mounted in appropriate racks, Similar 
rack cabinets are used for testing with the distinction that 
every second slot is occupied by ATL&SW modules as it is 


NTT 


| $7 
i Ww 
eel se 8 nr 
leanne . 
| i i 1 GOT re 
| f | f Dh | ISDN OO 
4 — -—- -—> SswitonW 
EMULATOR 


ET] AL | BT a a ET | Al 
or My or | 4 
COT] Bw iGo CON Bw 


i i ei 
co o 


Channel selecting ewitely be Obi 
Be de nsasa | A F 
Measuring module la We 247 
for analog channels ous 468 / 


ooverter 


Supervising PO 


Vig, 3, The main funetional units of the test system, ‘Tested 
devices are shown in gray, 

AL, & SW © Artificial Line & Module Switch 

HT xchange Terminal 

ST» Subseriber Terminal 

COT + Central Office Terminal 

R'» Remote Terminal 

shown in Vig, 3, There is also a certain possibility that mixed 
(POTS and ISDN) DSPG systems will be introduced in 
production, It would be very inconvenient to carry out the 
lesting of mixed devices by plugging them twice in two 
different machines, 


The system has been divided in six funetional units: 


contol and supervision 

artificial line(a) 

unit and channel selecting 

* measuring module for analog loop devices 
* computer controlled U interface NT 

* emulator of the ISDN central office awiteh 


Unit and channel selection is accomplished by eleetror 
mechanical relays, which have been chosen after thorough 
analyses Of all possible devices had been done, Being a 
mechanical device a relay has many drawbacks, Modern 
solid-state opfomos relays have excellent properties eg, 
insulation resistance, switching times and number of 
operations, but ordinary miniature relays offer higher denaity 
of changeover contacts and a lower cost per contact, The 
latter is of great importance because a large number of lines 
lias to be switched, ‘The relays, which connect the subseriber 
and CO lines on the measuring buss, are mounted on the same 
card as the artificial line (ALA&SW), 

The tests are activated and controlled by a PC using two 
standard RS»232 serial communications that are converted to 
RS-485 serial buss in order to communicate with the 
measuring and switching units of the system, The designed 
converter provides also galvanic separation of the computer, 
The network termination (NT) shown in Mig, 4 is a POMCLA 
device that is controlled by the testing program, It is used for 
BER tests of the selected channel in conjunction with the 
COS emulator, which is a standalone testing instrument, This 
test functionally verifies both 64 kb/s data channels (AL and 
B2) in each direction and the 16 kb/s signaling channel of the 
basic rate access (BRA), 
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Ree 
Fig. 4. Detailed diagram of testing system 
TIL ARTIERCIAL TRANSMISSION LINE 


A. HDSL 


HDSL is a transmission standard for fall-duplex conmme- 
nication over ordinary existing telephone loop UTP cables. It 
uses four-level pulse amplitude modulation (PAM) code, 
which is also used in DLL (BRA) This code is usually 
referred as “2B1Q”. The spectrum of HDSL signal depends 
on the bit rate. For the communication between the owe 
terminals of the DSPG system a flat power density spectrum 
up to 200kHz was assumed. Above this frequency the 
spectrum decays with more than 4Q0B ek. 


B Twisted Pair Cable Somadation 

_ The ATL designed as a passive Owo-port network with the 
frequency response matched to the real UTP line it the range 
up to 2 MHz. It has to be symmeitic since data are transferred 
in both directions. The device has to provide a path for the 
DC feeding current of remote units (RT, ST). In order to 
achieve a high level of flexibility of the system following 
additional features conceming ATL have been required: 
® two lengths of © 0.4 mm PE cable: 3.5 km and 4 kin 
© two values of feeding current resistance: 600 Q, LKQ 
® selectable decrease of overall attenuation for 6 dB. 


Fig. &. Primary parameters of © 04 mm PE cable. Vertical 
_ Seale is relative. 
_ The primary parameters of the mentioned UTP cable [3] ate 
shown in Fig. 5. The fourth parameter G’ is not appearing in 
the diagram since modem insulators do not exhibit any loss at 
ow frequencies. The main reason that 2° and L° vary with 
frequency is the “skin-effect”. Based on cable data and the 


lumped cirouit model ef a short section of the transmission 
tine [UL] the frequency response of the line of the length / 
assuming G" = 0 is found to be: 


A(@) =——+—., 1) 
( = ov ( 


and reflectance of the load 7} is defined with 2 and &, 


Riw)s jel ey 2 
&y = US IEH 2 AO and t". = & e (2) 
FOX Ry + Lo 
y(@) = (RY feb eC Q) 
The absolute value of f(@) for / = 3.5 kin is plotted tn Hig. 6. 
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Pig. & Calculated frequency response (1) of UTP tine and the 
response ofthe two-port cirouit of Mig. 7 (0 = 3.5 kin) 


For comparison on the same diagram the response of the 
synthesized network shown ia Mig. 7 is alse plotted, One can 
note quite good matching between response of the actual line 
and the simulating cireuit, The values of the elements have 
been optimised by numerical simulation. The resistors Aj 4 
and Xxo have been added to the basic RC ‘Tfour pole in 
order obtain the requited value of the DC resistance, The 
properties of the cireult are controlled by relay switches that 
are driven by a microcontroller (aC), The final version of the 
cireuit has been realized and tested, Measured frequency 
responses of tWo circuits are shown tn Fig. 8. Por frequencies 
below S00 KHe both circuits exhibit good agreement with the 
calculations, Due to stray inductance of the realized cirouit a 
resonance occurs, With some minor corrections of the PCA 
layout the characteristios has been significantly improved, If 
necessary the response can be improved if ceramic or other 
low stray inductance capacitors are used instead of metalized 
foil types. These capacitors have been used because of the 
high DC voltage on the HDSL, 


nn 


Fig. 7. Electrical scheme of the artificial telephone tine 
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Fig. 8. Measured frequency response of the original and 
improved circuit. Simulating line length is set for /= 3.5 km 


IV. MEASURING UNIT FOR ANALOG CHANNELS 


A. Hardware 


The heart of the testing system represents the measuring unit 
for analog channels. This unit mimics the exchange and the 
subscriber’s telephone. The main functional element of this 
unit is a uC that executes all tests concerning POTS. The pC 
is not appearing in the scheme (Fig. 9) to maintain it 
perceivable. All analog voltages and currents are conditioned, 
suitably detected and digitized by 10-bit ADC contained in 
the nC. The exception are the measurements of the voltages 
and currents in connection with ringing. The frequency of 
ringing is too low (25Hz) for efficient classical detection 
without unacceptable long delays. On the other hand 


PULSE SINE 


RING 
GENERATOR DETECTOR GENERATOR 


Fig. 9. Block diagram of measuring circuit for POTS units 


sampling and digital signal processing of low frequency 
signals are child’s play for contemporary controllers. 
Frequencies, distortions of pulse widths and time delays are 
measured by internal timers that are activated by 
corresponding digital inputs and/or internal clocks. The 
communication with the master PC, which controls the whole 
system, is effected by serial communication using the RS-485 
standard. Once a test is started the uC is left on its own until 
the task is finished. At that point the results of that particular 
measurement are returned. In this way no external interrupt 
can disturb the measuring procedure. 


B. Measurement of THD of Ringing Voltage 


Total harmonic distortion (THD) of the ringing voltage 
generated by ST should usually be kept below 10% [4]. The 
algorithm of this measurement is implemented in the software 
of the uC. At the beginning the exact period of the ringing 
signal is determined. Then one period of the incoming signal 
up(t) is sampled in M points and stored in memory. After all 
the samples are stored the discrete signal x(n) is computed by 
subtraction of the mean 


x(n) = u(n) — ¥ =u(n) - 4 > un) (4) 


The power of distortion Pp is calculated using the Parseval’s 
theorem [2], therefore only the power of the signal P and the 
amplitude c, of the first harmonics are required: 

M-1 


1 
P = xays =— >) ¥'(n) (5) 
M j20 
guide! 20 pines) 274 
=— cos — =— x(n)sin—n (6 
aap Let eosae” bu= a7 25 t) ea 
2 Isvo 1 


1 
Pp = XRus — 54 = Xeus 5 (ar +B) (7) 


SED: 
THD = 2 (8) 


The calculations of (4), (5) and (6) are effectuated by the nC 
of the measuring unit. The expressions for the sine and cosine 
component of the first harmonic amplitude are implemented 
by multiplying x() by 16 different stored values representing 
the harmonic functions. The rest is computed in the main 
supervising program that runs on the PC. The whole method 
requires only three intermediate results to be send, namely ay, 
b; and P. The division by M in (5) and (6) has been simplified 
to merely a shift to the left for six bits by selecting M = 64. 
An analysis of the accuracy of the described method can be 
found in [6] 


V. CONCLUSION 


The described testing system has been realized and tested. 
The flexibility of the artificial telephone line has turned out to 
be very useful. The variable length and attenuation help to 
categorize the devices regarding the length line. The 
supervising program has undergone some minor changes 
mostly due to desires that came out during practical use. 
Different relative signal levels in different countries will 
require an improvement of the analog channels. 
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A NEW WDM PROTOCOL: 
ARCHITECTURE AND PERFORMANCE 
ANALYSIS 


Ioannis E. Pountourakis 


Abstract— In this paper we study a single-hope network for a 
passive star topology and suggest a Multichannel Control 
Architecture (MCA). The proposed WDM network with an 
appropriate Network Interface Unit (NIU) at each station 
gives the solution to the problem of the electronic processing 
bottleneck that the single common shared control channel for 
pretransmission coordination introduces [1]. The novel 
concept of MCA and the distribution of control information 
over a number control channels, in conjunction with a suitable 
NIU minimize the headers processing requirement at each 
station [2]. An analytic model is developed and analyzed for 
performance measures for finite population. The effect of 
Receiver Collision is analyzed and estimated the average 
rejection probabilities at destination. 


Index Terms—wavelength division multiple access WDMA, 
Multichannel Control Architecture MCA, Network Interface 
Unit NIU, electronic processing bottleneck, receiver collision. 


I. NETWORK ARCHITECTURE MODEL AND ASSUMPTIONS 


The system under consideration as Figure 1 shows is a 
wavelength division multiplexed broadcast passive star 
network. The bandwidth of the optical fiber is divided into 
v+N channels each using a _ different wavelength, 


Agcy AaoAgy, to serve a finite number N of 
stations. Each station is assigned with a fixed wavelength 
A, {i =1,2...N} for transmitting data packets. For 
propose of signaling we introduce the MCA at 
wavelengths 1.,,...,4,, which is shared by all stations for 


coordination of data packet transmission. The network 
system can be described as a [cc ]” - 77 — FR” —7R . It 
means that there are v control channels and each station k 
has a tuneable transmitter tuned at each of A asiest a 
Also the same tuneable transmitter can tuned at its 
dedicated A, .The outcoming traffic from a station is 


connected to one input of the passive star coupler. The 
NIU of every station uses v fixed tuned receivers one for 
each control channel and one tuneable receiver to any of 


data channel A gi>--sAan - The incoming traffic to a user 
station is splitted into v+1 portions by a 1x (v+1) WOMA 
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splitter as Figure 1 indicates. In our analysis the access 
method to MCA is based on the ALOHA protocol. The 
transmission time of a fixed size control packet is used 
as time unit(control slot) and the data packet transmission 
normalized in control slot time units is L(L>1) which is 
called data slot. The control packet is consisting of the 
transmitter address, the receiver | addres as is shown in 
Figure 1. The access method follows the "tell and go" 
policy which is suitable when the round trip delays are 
much longer than the packet transmission times. By this 
strategy a station wishing to (re)transmit a data packet 
informs the destination about its intention without waiting 
any more. This policy has the advantage of decreasing 
the waiting time before data packet transmission. A 
station will hear the result of the transmission of its control 
packet by listening to the star coupler multichannel system 
since it is operates as a broadcast medium. In the receiving 
mode if a station sees its address announced in a control 
packet, immediately adjust its receiver to the transmission 
wavelength channel(station) which is_ specified in the 
control packet for packet reception. We say that the 
tuneable receiver of a station Z is active if it is tuned 
receiving a data packet from a data _ channel 


Au tk =1,2...N}. If a data packet is addressed to station Z 


and the receiver is active, the packet is rejected. This 
phenomenon is called receiver collision [3],[4]. The 
stations participating in unsuccessful transmissions defer 
their retransmissions for a random time until successful 
retransmission. We assume uniform distribution of 
retransmissions. The random time delay introduced 
between two consecutive retransmissions of backlogged 
stations is uniformly distributed from 1 to K time data 
slots. We also assume that the total offered traffic from 
new generated and retransmitted packets obeys to Poisson 
statistics. 


II. ANALYSIS 
A station desiring to communicate with an other, choose at 
random a control channel A,, at the instant of its 
generation or at the end of the random time chosen for 
retransmission and sends first the control packet and after 
that the corresponding data packet over its own dedicate 


A, data channel. Taking into account that each station 
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choose randomly a control channel independently from the 

others in a complete unsynchronized manner to transmit 

following the unslotted ALOHA protocol, the vulnerable 

time period for the control packets lasts 2 time units. 

Let 

G = the mean offered traffic over v-channel control 
system in atime unit period, that obeys to Poisson 

statistics. 


G,= G/v, the traffic to j,, control channel je {1,2,...v} 
given that each control channel is chosen with equal and 
constant probability p; =1/v. 


Pic = the probability of one Poisson arrival ina control 
channel during a time unit. Then 
Spee 
pte Uva 2G/v (1) 
yv 


A,, = random variable representing the number of 
control packets in atime unit 0< A, <v. 


Thus the probability of finding A, = k control 


channels everyone with one Poisson arrival during a time 
unit, obeys to binomial probability low. 


Pr[A,, = k] = ae (eer e (2) 


S, = the average successful transmission rate of 


control packets in a time unit in steady state. 
Pd) -2G/v 
S.=E{Pr[A, =k]}= 2% kPr[A, =k] =Ge 
k=l 


(3) 
We define P, as the probability of successful 


(re)transmission of a control packet over the control 
multichannel system. From (3) we take 


Pes) Giaie 7°" (4) 
G, = the offered traffic per data slot per station. 
G 
G,=—L (5) 
N 


Sj; = the average successful transmission rate of data 


packets with destination a given station during a data slot 
time. 


ne 


The throughput reduction induced by receiver collision is 
related with the possibility a receiver to be active. Thus we 
examine what will happen if a data packet which starts 
transmission at time instant ¢, and has as destination a 
station Z who belongs to the set 1,2,...M. It is obvious 
that if a data packet has started transmission between 
t, —L and t¢, will have activate the receiver of station 
Z and a receiver collision occurs [3],[4]. In the opposite 
case, the Z station tuneable receiver is idle and the data 


packet will be received correctly. Thus the vulnerable 
period time lasts Z time units. It is evident that any other 
data packet transrhission, starting between ¢, and ¢, +L, 
will find the station's Z receiver active and will be 
aborted, causing receiver collision. In other words the 


collision window interval is extending from f, to 


t,+L. The multichannel nature of the control 
communication system allows more than one successful 
headers transmission from different control channels to 
denote the same destination of data packets during the 
same time unit. Thus the station receiver is tuned only to 
one of incoming transmissions and rejects the others. In this 
case, there are different policies to select which packet is 
received correctly at destination. As an example we can say 
that the data packet that is transmitted on the lowest data 
channel number win the competition. The probability that 
station Z is destination of a data packet is 1/M (for sake 
of simplicity of the analysis we assume that a station may 
sent packets to itself). 

We define the following events for the vulnerable time 
period +, -L <i <t,: 

C ,, the successful transmission of a control packet in 

J», control channel je {1,2,...v}, starting transmission in 
the 7,, —1 time unit of the vulnerable period. 


B,, the data packet starting transmission in the 7, time 
unit, whose corresponding control packet has been 
transmitted over j,, control channel at 7,,—1 time unit 
has as destination a given station, let's say Z . 

Taking into account the independence of the of the C, and 


B 7; events for each time unit, we define A as the event that 


no one data packet has as destination the station Z during 
the vulnerable period. 
IE 13 ees or, 
A=(\(\C,f)8,) ) 
ay 
We define as P_,, 


a data packet from station Z starting transmission at the 


the probability of correctly reception of 
given time instant ¢, . This probability is equivalent with 
the probability P(A). From (7) we get, 

L v 


i=l J= 


ee 
= iS PCB of Cy) PCy) 


fa (8) 
L [eGn, 5 
= tiie ay 
i=l Nv 
[eG 
Se ae ot 
Nv 
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We define as throughput in steady state, the fraction of data 
packets that are received correctly at destination by a station 


during a data slot time. 
Sy = Sa. Pcor (9) 
Using approximations in the above equation, we take 


| Bh Mies 23 G _2GIv 
Sy *—Ge °°" exp[-— Le" (10) 
N N 


lll. DELAY 


Let D =the interval between the time a data packet is 
generated and the moment that has been successfully 
received at destination. Data packet delay D, is composed 
from two parts as follows: 


D=D,_+L+1 (11) 
D, = the delay from packet transmission to beginning of 
the successful reception at destination. 

F_ = the probability a packet successful reception at 
destination in steady state. From Eq. (10) we get 


aie Ge 
F, =e 7°! expl-— Le aba (12) 


Q,(n) = the probability a packet successful 


retransmission after trials. Assuming that the probability 
of success is the same on any try, 1 has a geometric 
distribution, i.e. 


QO,(n) =F (1-F,)"" (13) 
R =the average number of trials for successful ransmission 
of a packet. So 

@ G 
R = E[n] = SnQ_(n) =1/F, =e7°"” exp[—Le °°'"] 

n=l N 

(14) 

D, =(K +1)/2, the average delay between successive 


retransmissions. The average packet delay D, is 
D=ED,]+L+1=(R-1I)XL+14+D,)+L+1= 


etna 


2G/v G 
{e se —]}{L4+14+(K+DL/23}4+L4+1 


(15) 


III. NUMERICAL RESULTS 


If we set the first derivative of Eq.(10) with respect to G 
equal to zero, we find the optimal value of G that maximize 
the throughput. 


16 
yo ty se 

S,(max)=L—e Ve 
2N 


68 


If the above equation is defferentiated with respect to v, 
can be found the optimal of v for fix values of N and L. 


2N 
Vopt ain (17) 


Substituting the above equation into (16) we take 
' 1 
Sy (max) = — (18) 
e 


The above value is independent of L . 

Figure 2 show, the throughput S, versus  G for 
v =1,5,10,20 control channels with Z = 50 time units, and 
N =100 stations. We can observe from the figure that the 
maximum possible throughput per station increases as the 
number of control channels increases until v=10. The 
explanation comes from Eq. (17) which denotes that the 


optimum value corresponds to Von 10 control channels. 
So for G,,, =10, we get S,, (max) = 0.3678. Figure 3 


depicts the average delay E[D], versus the throughput S,, 


for v=1,5,10,20 control channels with JZ =S50time 
units, N=100 stations and K =100 time units. A 


comparison between the curves proves that for v > ee 


we get the best througput-delay behaviour. 
CONCLUSIONS 


The novel concept of the Multichannel Control Architecture 
and the distribution of control information over a number 
of control channels, in conjunction with the fixed tuned 
receivers, minimize the headers processing requirement at 
each station. Each station fixed tuned receiver has only to 
process packet headers at a rate divided by a factor v 
(fixed tuned receivers number) compared with the single 
control channel communication system. In this way a 
network designer can overcome the limit of the maximum 
processing rate of the electronic interface of a station. 

We have presented and analyzed a new protocol 
architecture in which the "tell and go" strategy has been 
adopted for the random access to the communication 
network. The proposed scheme except for the reducing of 
the electronic processing bottleneck, improves the 
performance measures by avoiding data channel collision 
improving any more the efficient use of optical bandwidth. 
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Figure 1: Passive star multiwavelength architecture. 
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Figure 2: The throughput S, per station versus the 


offered trafficG for v=1,5,10,20 control channels, 
L = 50 time units and N = 100 stations/ channels. 


E[D]Ctime units) 


0 0.1 02 03 04 
SN(dato packets/dato slot) 


Figure 3: The average delay£[D], versus the 


throughput S, per station for v =1,5,10,20 control 


channels with L = 50 time units and N =100 stations/ 
channels and K = 100 time units. 
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Minimising the Impact of Internet-bound Traffic 
in the PSTN/ISDN Network 


Dr N. Papadopoulos, $. Charalampous (Cyprus Telecommunications Authority, FTS-Planning) and 
Dr J-P Combot (Network Design&Planning, ALCATEL France) 


Abstract 

The PSTN/ISDN network infrastructure is dimensioned to support 
traffic characteristics which follow a Poisson distribution pattern. 
Internet-bound traffic characterised by long ‘conversation’ times 
deviates from these characteristics resulting in congestion and 
degradation of the grade of service. 


This paper discusses, drawing from the work done in Cyprus 
Telecommunications Authority, (a) the effects of Internet-bound 
traffic on the quality of service, (b) the short term technical 
solution applied and (c) the long term solution envisioned. 


Yndex Terms 
Internet-bound traffic, offloading, grade of service (GoS) 


I. INTRODUCTION 


Cyprus Telecommunications Authority (CYTA) has been 
experiencing, during the last three years the effects of the 
Internet usage explosion. Currently, 5 ISPs provide access 
to around 35.000 Internet service users (representing a 
penetration of 18% of the residential household base), 
through the CYTA PSTN/ISDN network. Presently, all 
ISPs have Points of Presence, PoPs, in all districts in 
Cyprus interfacing with ISDN via Primary Rate Interfaces, 
on ALCATEL switches thereby offering local call charge 
access to the Internet. 


Internet traffic creates a number of problems for the 
PSTN/ISDN network but ultimately the most critical 
problem is that it increases the load on PSTN/ISDN 
resources. As a consequence the costs experienced by 
network operators for a given grade of service, GoS, are 
increased while they result in little or no compensating 
value. 


The Internet-bound call distribution (%) from different 
networks, and average conversation time is given in Table 1 
(data from Call Detail Records representing a sample of 2% 
of calls in 4 weeks -7 days x 24 hours- throughout the 
year): 


Table 1: Internet Calls Characteristics 


The effect of Internet-bound traffic is congestion, its root 
cause being the fact that Internet calls have a much longer 
duration than voice calls (2-5 min) for which the network is 
designed (see Table 1). Congestion is initially observed at 
the point of concentration of the Internet-bound traffic, that 
is, the access point of the Internet Service Providers, ISPs, 
at the Local Telephone Exchanges (LTE) where they 
interface with the Telephone Network. Such a development 
leads to internal congestion at the subscriber stage and 
consequently to low GoS for ordinary subscribers. As the 
Internet-bound traffic volume increases congestion 
problems develop at the origination sites, i.e. the LTEs, and 
the interconnection trunks between LTEs and Regional 
Telephone Exchanges (RTEs) as illustrated in Figure 1. 


Figure 1: Public Network Topology 


PSTN/ISDN 


II. SHORT TERM SOLUTION 


The work undertaken by CYTA (FTS-Planning) aimed at 
providing a fast, economical and sound technical solution to 
cover a short-term perspective that would both satisfy 
demand for PRI interfaces by ISPs and secure acceptable 
grade of service (GoS) for ordinary subscribers. The 
solution was applied at the egress switch which was 
identified as the worst potential point of congestion, and 


was based on the segregation of ordinary and Internet- 
bound traffic, through the use of dedicated Internet traffic 


Call 
% 
8,49 L 
subscriber stages. These would accommodate the heavy 


| 8,49 | 14,47 
aes a 0,018 | 11,87 
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Internet traffic interfaces thus preventing the degradation of 
grade of service to subscribers sharing the same internal 
trunk lines from the subscriber stage to the exchange core. 
Furthermore, the solution provided for load sharing of 
Internet-bound traffic between two LTEs in the case of 
popular ISPs, based on the erlang measure of the traffic 
routed from each originating LTE. Such a configuration 
exploits the two-level hierarchy of the Public network (local 
and regional telephone exchanges; each local exchange 
connects to all regional exchanges in its district) and 
provides not only for better route utilisation but also for 
service continuation guarantees for the ISPs. The topology 
of the PSTN/ISDN -— ISP relation is illustrated in Figure 2. 


Figure 2 : PSTN/ISDN - ISP Connection Topology 


The measured effect of the Internet-bound traffic is evident 
in the comparison of the GoS measurements performed 
before and after the application of the described solution in 
a reference LTE in Nicosia for the provision of service to a 
given ISP (Note that the subscriber capacity of the LTE is 
unchanged). The comparison results are given in Table 2. 


Table 2: GoS for a reference LTE 
GoS Parameter 


Completed Calls * 


Busy Calls * 
Not Answered Calls * 17,30 


Unsuccessful Call Attempts due 0,75 | 2,30 


to Network Failure * 

Calls Reaching Destination * 
Ordinary Subscriber Load 407,7 
Internet termination Load 
Erlang/Ord. Subscriber 
Erlang/Internet Access Port ** [0,00 | 0,749 | 0,968 | 
Average Origin. Traffic (Er!) 
Average terminating traffic (Erl) 
Average Call Holding Time | 88,9 98,7 
Originating (sec) 


Average Call Holding Time | 115,6 | 271,1 | 335,3 
Terminating (sec 


% Internal Traffic 


* -: ratio with respect to Processed Calls 
** - estimated 


It is apparent from the results that the major impact of the 
Internet usage is a dramatic increase in the volume of 
terminating traffic (equivalent to an exchange with double 
the subscribers for the example at hand). This should also 
be associated to the higher percentage of the ‘Unsuccessful 
calls due to network failure’, and the dramatic difference 
between the traffic attributed to ordinary subscribers and 
the Internet access ports. Another consequence is an 
increase in the % of busy calls and the % of internal traffic 
due to increased Internet related traffic. Furthermore, a 
change in the exchange Busy Hour was noted: from 18:30- 
19:30 to 19:00-20:00. 


The changes in the traffic processed by the LTE in question 
are illustrated in Figures 3 and 4, produced using 
ALCATEL’s traffic measurement tool SCOOP (Switching 
Centre Operation Observation Processing): 


Figure 3 : Traffic Processed by Reference LTE : Before 
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Figure 4 : Traffic Processed by Reference LTE : After 
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Ill. CURRENT WORK 


The continuing increase of the Internet-bound traffic called 
for a solution that will enable offloading at a point as near 
as possible to the ingress (Internet user) without imposing 
any unnecessary financial burdens to the network. Given 
the current network hierarchy (as illustrated in Figure 1), 
work began in search of solutions that would effect Internet 
traffic offloading at the RTEs which were identified as the 
second worst point of potential Internet traffic congestion. 


Research revealed that the available solutions are based on 
two philosophies: 


The first, made possible by the latest ALCATEL operation 
system software, supports PRI interfaces to the core of the 
exchange. Hence, adoption of this philosophy would entail 
connection of the ISPs to the core of the RTEs using the 
same interface as used now (PRIs). The result would be a 
reduction of potential congestion problems on the routes 
between the RTE and the terminating LTE where the ISP 
interfaces the public network. In addition, any potential 
congestion problems at the interfacing exchange affecting 
the GoS of ordinary subscribers are eliminated. 


The second, which presents the same advantages as the 
first, eliminates the necessity of PRI interfaces between the 
public network and the ISP using instead peer-to-peer PCM 
connections for the Internet traffic offloading. This solution 
is dlustrated in Figure 5. 


FigureS: Internet Traffic Offloading via PCM 
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IV. FUTURE WORK 


The advent of next generation networks and the apparent 
evolution of circuit-switched networks to multiservice 
packet switched networks, paves the way for CYTA to look 
for new ways to accommodate the increasing Internet- 
bound traffic volumes. These might include use of the ATM 


SLT: Signaling Link Terminal 


or a future IP backbone network. At the same time work is 
being carried out on the use of the ADSL technology in 
order to alleviate potential congestion problems emanating 
from the increasing penetration of the Internet usage and the 
subsequent use of the Internet-bound traffic. 
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Performance of Common Channel Access in 
UTRA 


Margarita Ruiz-Garcia, C. Tellez-Labao, J. M. Romero Jerez, J. E. Burgos-Lati, A. Diaz Estrella 


Abstract— This paper deals with the study of the mechanism of 
medium access in the reverse link within a third generation 
UTRA system. The voice service is beamed off through a 
dedicated channel whereas a common channel is used for a 
very widespread service nowadays like the WEB traffic. The 
results reveal that there are great differences between the only 
voice system and the only data system. We can conclude that 
the maximum number of users within the system will depend 
only on the required quality of service in the case of the voice 
system (admission control), but in the case of the data system 
it depends on the available resources in the access channel. 


Index Terms—UTRA, access channel, available resources 


I. INTRODUCTION 


The explosive growth of the Internet and the continued 
increase in the demand for all types of wireless services 
(voice and data) are fuelling the demand for large increases 
in the capacity and data rates in the next generation wireless 
communication systems. By means of the successful 
deployment of IS-95 cellular systems, the CDMA technique 
has been recognised as one of the best candidates to achieve 
that goal [1]. This work focuses on the European UTRA 
standard (UMTS Terrestrial Radio Access) [2] in order to 
study the dilemma of the services integration as far as 
access and transmission packet schemes is concerned: voice 
packet transmission takes place on a dedicated channel and 
data packet transmission on a common channel. The choice 
of these transmission schemes is based on the traffic 
characteristics of both services (real time) and on the 
amount of information to transmit (frequency of the 
packets). The aim of our study is to analyse the 
performances of these transmission schemes so as to get 
QoS guarantees according to the uplink available resources 
of the common channel. 


II. COMMON AND DEDICATED CHANNELS IN UTRA 


The UTRA system dealt in this paper has a common uplink 
channel called RACH (Random Access Channel), where 
the users can send their access requests, and uplink 
dedicated channels called DPDCH (Dedicated Physical 
Data Channel) and DPCCH (Dedicated Physical Control 
Channel) [2]. The dedicated channel DPDCH is used to 
carry dedicated data generated at layer 2 and above. The 
dedicated channel DPCCH is used to carry control 
information generated at layer 1(figure 1). The frame has a 


Departament of Electronic Technology in the University of 
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length of 10 ms and is splitted into 16 slots. RACH 
common channel is based on a slotted ALOHA. A mobile 
station can start the transmission at a number of well- 
defined time-offsets, 8 time slots, relative to the frame 
boundary of the received downlink BCCH (Broadcast 
Control Channel) of the current cell. The random access 
burst consists of two parts, a preamble part of length 1 ms 
and a message part of length 10 ms (figure 2). The 
preamble part consists of a signature of length 16 complex 
symbols. There are a total of 16 different signatures. The 
message part has the same structure as the uplink dedicated 
physical channel. 


The dedicated channel provides a more reliable way of 
transmission than the common channel since a channel is 
exclusively reserved for a terminal and collisions are 
avoided. Besides, fewer mistakes will take place in the 
dedicated channel than in the common channel thanks to the 
use of closed loop power control. This is an important issue 
when it comes to considering transmissions in common and 
dedicated channels. 


Ill. TRAFFIC MODELS 


In this work we have studied two services: a voice service 
and a data web service whose models will be described in 
this section. An ON-OFF scheme with exponential 
distributions of the ON and OFF periods models voice 
traffic (figure 3). One interesting parameter is the activity 
factor o like a measure of the medium rate of the resources 
occupied by the service. The values used for the parameters 
of the models are depicted in table 1. 
Session ON 


i Session + Idle ON + OFF 


Figure 1: Frame structure for uplink DPDCH/DPCCH 


Random access burst 
Preamble 


1 ms 0.25 ms 10 ms 
Figure 2: Structure of random access burst in uplink RACH 
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Figure 3: Scheme ON-OFF for voice traffic 


iu ~ Mean values of ) of exponential distributions — 


“See ‘ON | OFF Session — Idle O. 
Voice [1s 1.358 [1808 [9008 [0.0709 _ 


Table 1: Traffic parameters Tor voice service, 


The data web traffic model has been developed by the 
Department of Electronic Technology in the University of 
Malaga, it is based on real traces and its main feature is its 
high variability [3], In the case of Internet and Intranet 
traffic, distributions of the ON and OFF periods are not 
exponential, the pareto distribution is used for these periods 

(infinite variance), Figure 4 shows the model being used 

where three different periods appear; 

1. First period: Requests, The user requests the web 
pages, The number of request packets can be estimated 
like the division between the size of request and the 
size of the RACH packet, The size of request is equal 
to the number of requests (pareto distribution) 
multiplied by the size of each request (lognormal 
distribution), 

2, Second period: Acknowledges, The user sends the 
ACKs corresponding to the received downlink pages, 
The duration of this period and the number of ACKs 
have a pareto distribution, The time between the ACKs 
has an exponential distribution, 

3, Third period: Reading time, The user is inactive, The 
duration of this period has a gamma distribution, 

The values of the statistical parameters belonging to the 

distributions corresponding to these three periods are shown 

in table 2 [4], 


| aL 


a SS IE <eRE TET 4 
Porlod 1 Porlod 2 Period 3 


Figure 4; Scheme for data web traffic 


Distributions for each period 
| Number of requests: 
Pareto; a=2,22, h=6.42 
Size of requests: 
Lognormal; mean ©352 bytes, desviation = 109,85 
bytes 
2 Duration of the period: 
Pareto a 1.1, mean 35,703 s 
Time between ACKs: 
Exponential mean*0.73 seg 
Number of ACKs: 
Divison; size of the page / size of the packet, 
Size of the page: Pareto w= 1,75, mean=40.6 Kbytes 
Size of the packet: 800 bytes 
3 Duration of the period: 
Gamma, mean@35,286 s, desviation=147,39 s 
Table 2; Traffic parameters for data service, 


IV, SYSTEM MODEL 


The following assumptions have been under consideration 

in the simulations: 

~The delay of the ACK/NACK has been traced to be a 
frame (10ms), that is, the access requests in frame i are 
acknowledge in frame i+], 

~The voice call requests that may collide or loose 
themselves are retransmitted in the following frame after 
the downlink acknowledgement frame without limiting 
the maximum number of new attempts, The aim is to 
determine the number of attempts from the simulation, 

~The maximum number of new attempts for data packets 
transmission is not limited, A stop and wait scheme in 
common channel for these packets has been used, that is, 
the user waits for the packet to be acknowledged before 
sending the next packet, 

~No backoff algorithm for collisions has been 
implemented, 

~The available resources, the timing offsets and the 
signatures and the losses channel have been modified in 
the RACH channel in order to draw their relation to the 
QoS parameters (delay, retransmissions and collisions), 


The simulations in this paper have been made by using the 
simulation tool OPNET, The parameters used in our 
simulations are depicted in table 3, 


Parameter , Value 

Frame duration 10 ms 

RACH timing offset per frame 2, 4, 6, 8 
Number of RACH signatures 1,4, 8, 16 
Common channel ACK delay 10 ms 

Common channel loss 0%, 5%, 20% ~ 
Dedicated channel logs 0 


DPCCH to DPDCH switching delay 10 ms 
Bit rate in RACH 128 Kbps 
Bit rate in dedicated channel 32 Kbps 


Table 3; Simulation parameters 


V. RESULTS 


In table 4 the maximum capacities for only voice and data 
systems are displayed according to the process gain and the 
required quality in each system [5], The aim of offering 
these data is to compare the number of users in the 
simulated stages with their capacities to study the viability 
of the access techniques proposed, 


System Parameters Capacity 

Only voice service SIR 2,G= 128 491 
SIR=1,G = 128 1000 
SIR = 2,G = 32 685 
SIR#5,G=32 269 


Table 4; Maximum capacity in different systems, G is the process gain and 


SIR is the signal to interference ratio, 


Only data service 


e Only voice system 
The way of dedicated channel transmission chosen for the 
voice means a low use of the common channel because the 
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users only access to it at the beginning of the call to send 
the access request, For this reason we can infer from figure 
5 that two time offsets or slots in RACH are enough for 
1000 voice users (access delay lower than 24 slots, that is, 
30 ma), This amount of users is roughly twice the capacity 
of the system (SIR@3 dB) from which we can conclude that 
the collisions do not limit the system, This appears in figure 
6 where we can see an exponential decrease of the number 
of collisions with the increase of the available slots, 
Besides, we have to direct our attention to three aspects: 1) 
The higher the slope curve is the higher the number of users \ 
is, 2) The decrease in the collision is greater when we move 
from 2 to 4 slots but later on the evolution is slower, 3) 
None of the curves exceed the 1% collision, a very low 
threshold, In figures 7 and 8 there is a minimum difference 
in the retransmissions distribution function for 500 and 
1000 users being the decrease slower the more the number 
of slots increase, In this way we can see that going from 2 
to 4 glots for 1000 users (figure 7) the retransmissions 
decrease from 2 to | approximately and going over 4 slots 
the retransmissions are 0, The same thing happens for 500 
users (figure %), In this case two access slots would be 
enough to limit the retransmissions to 2 being the number 
of users greater than the system capacity, 


¢ Only data system 

The first simulations in this system revealed that, due to the 
common channel transmission, a minimum resources 
threshold is necessary to avoid the congestion (99% of 
collisions), This minimum threshold (8 access slots and one 
signature for 300 data users) is achieved in all the following 
results, In figure 9 it is shown that 4 signatures or sequences 
for 300 users and 16 for 100 users gives birth to very little 
differences in the transmission times for page requests, As 
far as the retransmissions produced by collisions is 
concerned, in figures 10 and 11 we can notice that by 
increasing the number of sequences from | to 4 there are 
differences in the distribution function, On the other hand 
by increasing till 16 sequences there are very little 
differences, Besides, the differences in the number of 
attempts for 100 and 300 users become bigger being smaller 
the number of sequences, This results tell us that 4 
sequences are enough for 300 users (300 users are 
equivalent to the maximum system capacity for a SIR@7 dB 
as we can see in table 4) because the QoS parameters are 
not improved when the number of sequences reach 16, This 
same thing is visible too in figure 12, where the collisions 
decrease 4 lot with a sequence growth from | to 8 but they 
do not alter too much when the growth is till 16, Besides, 
the differences for 100 and 300 users are very high for | 
sequence (11% for 300 users against 3% for 100 users) and 
they slow down exponentially when the number of 
resources is bigger, In figure 13 there is also a great 
difference in the power histograms for | and 4 sequences 
and a great similarity in the ones related to 4 and 16 
sequences, In the histogram of | sequence going from level 
(0) to 5, the 0 level is the most recurring one whereas in the 
histograms of 4 and 16 sequences going from 0 to 10, the 3 
level has the highest probability, These results show us that, 
due to the characteristics of data traffic, the most common 


thing is to find a maximum of 3 active users in the RACH, 
For this reason the threshold of 4 sequences is enough to 
offer quality guarantees, 


VI, CONCLUSIONS 


In this paper, we have dealt with the necessary resources in 
the common channel RACH in an uplink UTRA system in 
order to offer guaranteed services. The study of the only 
voice system reveals a low spending of the common 
channel resources for a number of users superior to the 
system capacity, This first result ratifies our choice of the 
common channel for the data transmission because the free 
resources can be used in this way, The simulation of the 
only data system, being the number of users close to the 
system capacity, resulted in our concern about the existence 
of a minimum threshold of resources necessary to avoid the 
congestion, Nevertheless, once we have completed the 
threshold it is not necessary to make use of all the available 
resources, which makes possible the voice integration, 
Following our analysis, we come to the conclusion that the 
use of this two access strategies is possible for voice and 
data web integration, 
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Figure 7: Voice system. Retransmissions CDF changing 
the number of available slots for 1000 users of voice 
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Figure 8: Voice system. Retransmissions. CDF changing 
the number of available slots for 500 users of voice 
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Figure 9: Data system. Time transmission CDF of the web 
asks for 100 users and 16 sequences and 300 users and 4 
sequences. 
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Figure 10: Only data system. Data retransmission CDF 
for 100 users and 1, 4, 8, 16 sequences. 
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Abstract— It is well known that the best results regarding 
concurrent system design are obtained when design errors 
are find in the earliest possible phase. For that purpose 
system specification is verified through model checking. We 
try to hidden as much as possible model checking formalism 
from designers viewpoint. 

First, a system is modeled as set of processes described 
formally as eztended finite state machine within SDL~~ lan- 
guage. Such description is translated into model checker 
SPIN where desired properties are verified. 

Special attention is given to the possibility of modeling vari- 
ous types of transitions and to a definition of the tool where 


model checking is performed. With such approach designer. 


can have SDL~~ specification verified against desired prop- 
erties. 


Keywords— model checking, verification of specification, 
SDL, SPIN, concurrency, protocol synthesis 


I. INTRODUCTION 


HIS paper presents an utilisation of the reachability 

based approach for model checking of concurrent sys- 
tems. Verification of the specification is performed through 
model checking. Differences between concepts introduced 
in model checking and formal description techniques are 
unified through common modelling environment. We re- 
strict our attention on concurrent systems described for- 
mally with the finite automaton class of models and reach- 
ability based model checker. With such approach designer 
can perform model checking without knowing in depth how 
model checker works. 
Model checking is focused on proving whether system sat- 
isfied desired properties like deadlock absence, unspecified 
reception, unexecutable transition or liveness. Formal de- 
scription techniques are focused on how to describe desired 
system in consistent and rigid way. System descriptions are 
usually too complex for manual analysis. Model checking 
and specification are supported with different languages. 
Each of them has constructs tailored to meet only model 
checking or specification needs. To connect specification 
and model checking, we introduce a translation of the spec- 
ification language into model checking language. 
Such approach is suitable for research environment for ex- 
perimenting with concurrent and coordination languages, 
and verification of specification. 
After introduction, model checking for verification of the 
specification is introduced. Previous work with specifica- 
tion and verification languages descriptions serves as back- 
ground for introducing SDL~~ language. After that the 
tool for model checking is introduced. At the end related 


work is presented and compared with our approach, to- 
gether with an example, conclusions and directions for fur- 
ther study. 


II. MODEL CHECKING FOR VERIFICATION OF 
SPECIFICATION 


One of the goals during complex software system design 
is to prove whether target system satisfies desired set of 
properties. Such kind of proofs can be obtained through 
actions taken under methodologies called automatic theo- 
rem proving and model checking. 

The best results are obtained when system is checked as 
early as possible during development. Roughly speaking, 
system design consists of requirement, specification and im- 
plementation phase. Each of these phases can be tested 
against desired properties. Such process is known as system 
validation. Thus we have a verification of the requirements, 
verification of specification as well as testing as verification 
of the implementation. 

To obtain the best possible results a system must be de- 
scribed in formal, mathematically rigid way through formal 
description technique (FDT) [1]. After that system can be 
model checked to prove whether it satisfy desired set of 
properties. 

We will not analyse the differences between model check- 
ing for each of the phases of system design. Rather on, 
the focus is on building specification model which fits more 
naturally into the model checking environments. We will 
refer to each phase as (formal) specification making no con- 
ceptual differences between requirements, specification or 
implementation. Semantical differences between specifica- 
tion models can be solved with different modelling within 
the same model checker. 

Special attention will be paid into connecting together the 
best suitable environments into the coherent and consistent 
problem solver. 


A. Previous work 


First ideas for such approach comes from signalling pro- 
tocol implementation in SPC switching system. Later on, a 
methodology for intelligent network protocol synthesis was 
developed [2]. In order to generalize such approach into 
the framework for concurrent system design four cases of 
protocol synthesis were defined: 

1. synthesis from service: develop the protocol from semi- 
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3. resinthesis: protocol is described narrative or semi- 
formal, describe the protocol formaly and model check it 
4. verification: protocol is formally described, model check 
is performed 

Resinthesis and verification are natural candidate for the 
extension of the methodology. The experience shows that 
the most of the problems were experienced in modelling 
phase. In other words, the problem was how to connect 
our representation with model checker’s representation. 
The first experiment [3] was direct translation: SDL di- 
alect found in ITU-T recommendation was translated into 
the PROMELA language. The differences in semantic be- 
tween SDL and PROMELA languages resulted with ‘the 
three level model [4]: 

Abstract level introduces all necessary sets, functions and 
theorems necessary for the analysis of the problem. 

Each problem is decomposed into the set of smaller units 
called ECP (Elementary Communicating Processes). In 
our case each SDL~~ is an ECP. ECP are connected 
together into the net. 

Operational level operationalize different semantics into 
the consistent one. 

Implementation level handles the programe modules. 


B. Specification and verification languages 


As we mention earlier, the specification model will in- 
corporate different specification and verification environ- 
ments. That means that a problem is specified in a given 
FDT while the model checker’s input is different FDT. As 
a solution, specification FDT language is translated into 
the model checker’s language. Thus we obtain a pair of 
languages: specification language with model checking (or 
verification) language. 

Theoreticaly speaking, each of numerous FDT languages 
can be in such way connected with verification language. 
In practical solutions, only smaller subset is in use. The 
intention is to give the framework for heterogenous lan- 
guage pairs interworking. SDL~~ to PROMELA is such 
pair used in this paper. Besides PROMELA [5] and SDL 
[6], there are other languages like PROD [7] and SMV [8]. 
Their possible interractions are shown in figure 1. More 
languages are analyzed in Appendix B of [1]. 

We do not have intention to find the most suitable pair as 
only solution for some class of problems. Rather on, frame- 
work will be defined to establish coordination between dif- 
ferent language pairs. 


B.1 SDL~~ language subset 


SDL~~ language subset is introduced within the follow- 
ing constraints: 
- language is based on extended finite state machine (fsm), 
- transition semantic is interrsection of SDL and 
PROMELA semantic 
- SDL~~ must be expressed and translated into ITU-T 
program SDL 
- SDL~~ and PROMELA must interract with other lan- 


guages: model must support the definition of ECP with 
different languages 


Concrrent 


languages yf ed 


finite state PROD 
machines 
high level PROMELA 
Petri nets 

SMV 
SDL 
Estelle 


Fig. 1. specification and verification languages 

SDL~~ language transition semantic is shown on fig- 
ure 2, input(z,...) denotes input information flow with 
data fields while output(...) denotes output data flow. 
Mealy and Moore finite state machine as well as transition 
with JF statements are introduced. 
Ideal solution will be to express all semantic construct from 
SDL language semantic specification given in ITU-T [6] rec- 
ommendations. Rather on, from model checking point of 
view a, 3 and y kind of transitions are introduced. 
On figure 2 a fsm has a0, al...a4 states with a transitions, 
2 and y¥ transitions have b0...62 and cO...c2 fsm respec- 
tively. The motivation for different transitions follows the 
neccessity to model check concurrent behaviour close to im- 
plementation (y transition). Concurrent behaviour regard- 
ing buffer capacities is model checked with a transitions 
while @ transitions are usefull in requirement phase. 
Transition usage is fully problem dependable. It is in- 
tended to establish framework capable of supporting con- 
strucs like while or for and to model programming lan- 
guages. 
Another important theoretical issue are communicating al- 
gebras like CCS [9] and CSP [10]. PROMELA language is 
establishe on Dijkstra’s guarded commands and CSP. The 
influence of CCS can be explained if try to model check 
a and ( fsm from figure 2. In general sense they are not 
equivalent. But in some practical situations (when we look 
if all information flows are present) they are equivalent. 
Even the more, two peer entities which are exact copies of 
a fsm, one on sending side, the other on receiving side are 
hardly able to communicaty if they are asynchronly con- 
nected. Model checking can help to analyze and solve such 
kind of problems. Syntax of language is in table I. 

The set of keywords is intentionaly redundant to allow 


TABLE I 
SDL7~~ LANGUAGE SYNTAX SUMMARY 


data fields 


<state_no> <state_code> 
<state_description> <process> 


STATE 


ENDSTATE 

NEXTSTATE 

INPUT 

OUTPUT 

.DIV, CASE, ENDCASE , SKIP 


<state_code> 
<state_code> <process> 
<message> <process> 
<message> <process> 


— FS 
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input(il,i2,i3) 


output] (03,04) 


output2(05,06) output3(07,08) 


input(il,i2,i3) 
output! (03,04);output2(05,06) 


input(il,i2,i3) 
output I (05,06);output2(07,08) 


input(il ,i2,i3) 
output] (03,04) 


IF 
: output2(05,06) : output2(07,08) 


Fig. 2. a, B, y SDL~~ transition semantic 


strightforward parsing: 

STATE : place with State Description, State Number, State Code 
and Process names 

ENDSTATE : end of the state 

NEXTSTATE : nextstate 

INPUT : input message 

OUTPUT : output message 

DIV : divergence to new substructure. Each substructure is marked 
with keywords CASE and ENDCASE containing other substructures, 
inputs, outputs 

CASE : substructure inside DIV: begining of substructure 
ENDCASE : substructure inside DIV: end of substructure 

SKIP : go to next command (hide non relevant fragments) 


III. THE TOOL 


The tool outline is introduced in figure 3 as a graph where 
boxes are program modules and circles test whether all data 
for subsequent program module are present. (CO)4*PE 
framework generates automaticaly such graphs. As an op- 
tion, graph from figure 3 is analyzed as high level Petri net 
providing model checking for framework. Another option 
is to build the graphic user interface with the same out- 
line as on figure 3. Clicking on boxes invokes desired pro- 
gram modules. Boxes ’knows’ whether all necessary data 
is present to succesfully invoke desired module. 

Core of the framework is performed in translate (trans) 
and verify (verify) program module. The purposes of other 
modules are listed in table II. 


TABLE II 
(CO)4PE FRAMEWORK PROGRAM MODULES SUMMARY 


module | description 


editor | general purpose editors like emacs 
or vt 
parser | parsing of a, @ and y SDL-~ 


modules; LEX-YACC perl script 
check if each transition satisfy 
semantic; implemented as perl script 


seman.check 


translate | SDL~~ translated into PROMELA and 
pan./bchmt] built; verifier SDL~~ 
to SDLpr 
rules | transitions are checked against 
design rules; 
verify | run pan 
Xspin | run spin GUI as an option 


interp.res | interprete results; go for next 


iteration if necessary 


IV. AN EXAMPLE 


Communication protocols are important part of dis- 
tributed systems. Signalling protocols are special class of 
communicating protocols supervising information flows be- 
tween switching nodes. Signalling system No. 7 (Q.7xx 
series recommendations) [11] together with interworking 
protocols (Q.6xx series recommendations) were starting 
point for our analyzis. The description of such protocols is 
given in narrative recommendations with some SDL dia- 
grams. The goal is to verify protocol and to find deadlocks, 
unspecified receptions, unexecutable transitions and other 
protocol errors. As an example ISUP protocol introduced 
in Q.6xx interworking recommendation is analysed. First 
proper SDL description is build. As transition semantics 
follows a transition semantic messages are redesigned. 
For example, output(ol,o2) becomes output,1, output,2. 
In the following example IAM message is expanding accord- 
ing to continuity check (CCH) and ‘end-of-pulsing’ data 
fields: 


STATE 01 01 Wait_call_setup_info 0; 

DIV; CASE; 

OUTPUT !IAM_CCHi_STNO_NOSTARTDA I; 
NEXTSTATE 02 0; ENDCASE; CASE; 

OUTPUT !IAM_CCHi_STNO_STARTDA I; 
NEXTSTATE 02 O ENDCASE CASE; 

QUTPUT !IAM_CCH1_STYES I; 

NEXTSTATE 02 0; ENDCASE; CASE; 

OUTPUT !IAM_O2COCY12FYES_STNO_NOSTARTDA I; 
NEXTSTATE Ki; ENDCASE CASE; 

OUTPUT !IAM_O2COCY12FYES_STNO_STARTDA I; 
NEXTSTATE Ki; ENDCASE; CASE; 


producing the follwing PROMELA code: 


proctype I() { 

s00: do 
::atomic{OI!IAM_CCH1_STNO_NOSTARTDA I;goto s01;}; 
::atomic{ 

OI!IAM_CCH1_STYES I; goto s01; }; 
::atomic{OI!IAM_CCH1_STNO_STARTDA I; goto s01;}; 
t:atomic{OI!IAM_CCH1_STYES I; goto s01;}; 
::atomic{OI!IAM_O2COCY12FYES_STNO_NOSTARTDA I; 

goto K1;}; 
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Fig. 3. SDL~~ model checker framework 


V. RELATED WORK 


A framework or meta model for the composition of 
processes is described in [12]. Interractions are events 
described as value-tag pairs. CSP, Kahn process net- 
works and Petri nets are models of computations described. 
Model can be transformed into “...a quit different model” 
by tag manipulaions. Meta model described here can serve 
as abstract model for (CO)*PE framework. 

Another approach is to use commercialy avaliable SDL 
tool. The same results can be obtained regarding model 
checking. But if we want to experiment with different spec- 
ification - model checking language pairs or to experiment 
with different transition semantics or algebraic structures 
our approach is more feasiable. 

There are other approaches of translation of SDL into the 
PROMELA in [13] and [14]. 

The PEP tool framework can model check SDL specifi- 
cation [15] with Petri net. Within the same team utility 
mckit has been developed to mix different language pairs. 


VI. CONCLUSIONS AND FURTHER STUDY 


In short, our results can be summarized as: *describe a 
system as set of ertended fsm through SDL syntax, then 
verifiy through SPIN model checker’. Different transition 
types (semantics) as well as connection with program SDL 
allows design flexibility. 

Although the idea of translating SDL into PROMELA is 
not new we introduce the connection between specification 
and verification languages through (CO)*PE framework 
allowing different transition semantics. The tool outline in- 
troduced serves as ’preprocesor’ for model checking. Such 
approach yields the possibility for experiments with mixed 
paradigms as well as heterogenous components composi- 
tion as well as to hidden model checking formalism from 
specification process. 

Future study will try to produce models for model check 
directly from problem description or concurrent software. 
Another research direction is synthesis of concurrent sys- 
tem from heterogenous components. (CO)* PE framework 
must ensure flexiblity for experimets regarding concurrent 
process composition. 
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Automatic translation from SDL specification 
to distributed Erlang implementation 


Marijan Kunéti¢, Oliver Juki¢, Miljenko Mikuc 


Abstract— This paper is focused on telecommunications 
system software specification and its implementation. 
Specification and Description Language (SDL) is widely used 
in concurrent system specification. System implementation can 
be done independent of specification. This paper presents SDL 
to Erlang translator that was developed in order to improve 
translation from formal system specification toward its 
implementation. Since Erlang supports distributed system 
implementation, we have extended SDL language to support 
distributed specification. Extension is done within SDL 
comments, without disturbing SDL language rules. However, 
translator is able to read those “hints” for system distribution, 
and to generate distributed Erlang code. 


Index Terms— software specification, telecommunication 
systems implementation, Formal Description Techniques, 
SDL, Erlang, translation. 


I. INTRODUCTION 


Computer software is becoming the dominant part in the 
development process of telecommunication systems. 
Formal Description Techniques (FDTs) represent a 
promising approach for obtaining confidence in the 
correctness of system specification, covering its hardware 
and software components. It is allowed to choose any 
programming language to implement system according to 
its specification, even to choose which system parts will be 
realized by hardware or by software modules. Specification 
and Description Language (SDL) is widely used in 
specification of concurrent systems, especially 
telecommunication systems. Erlang is programming 
language developed in Ericsson Computer Science 
Laboratory aimed for telecommunication software 
development. It is based on Communicating Finite State 
Machine model, which ensures consistent implementation 
of SDL specification. 


This paper is structured as follows: we will first give a brief 
introduction to SDL and Erlang programming language. 
Since relations between SDL and Erlang are considered in 
previous works, we will not focus on it. In the second 
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section we will describe development of our SDL to Erlang 
translator. Finally, we will present distributed features and 
code generation. 


Il. SDL 


SDL (Specification and Description Language) is a standard 
language to specify and describe systems. It is widely used 
in telecommunications area as well as other areas that 
consider concurrent based systems. SDL was developed by 
CCITT (Comité Consultatif International Télégraphique et 
Téléphonique), now ITU-T. There are two ways of 
specification: graphical and textual (SDL/GR and SDL/PR). 
Graphical representation makes SDL very easy to use, since 
human perception is primary visual The most popular part 
of the language is process diagram. However, complete 
specification covers structure of the whole system, 
including process behaviors. 

Specification tends to view system as a black box, where 
the behavior of the system is seen as a whole. Even if the 
internal parts are specified, an implementation may choose 
some other internal structure as long as the overall behavior 
at the higher level remains the same. 

SO, SDL sees the world as divided into two parts: the 
system and its environment. Only the system is under 
consideration while writing SDL specification. Some 
assumptions on the environment exist. Since system 
communicates with the environment by signals that are 
specified previously, assumption is that environment 
behaves reasonable; only previously specified signals can 
stimulate the system. Signals that has not specified are 
consumed but without any influence on the system (if 
environment wants to stimulate the system, it has to know 
how to do it). This ensures complete protocol specification 
for “environment-system” communication. This protocol 
can be specified using some other languages, e.g. MSC 
(Message Sequence Chart). In combination with SDL, it 
ensures powerful so-called “language chain” to specify 
system. 

As it mentioned above, system specification delineates the 
part of the world that is to be specified. The final result of 
all activities related to specification should be to specify 
overall behavior of system. But, this is not done at the 
system level only. Instead, the system behavior is defined 
by the behavior of its constituent blocks and the way they 
communicate with each other and with the environment. At 
the lower level, the block behavior is defined by the 
behavior of the processes that are contained within the 
block and way they communicate with each other and with 
edges of the block. The lowest level is process level, which 
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defines particular process behavior in terms of process 
graph, which is the most known part of the SDL. The 
system level defines static structure of the system, the 
process level defines dynamic behavior of the system, and 
the block level defines both static structure and dynamic 
behavior of the system. 

Within the systems, communications channels connect 
blocks with each other as well as with the environment. 
Within the block communication routes connect processes. 
Every communication route, which connects process with 
the edge of the block, has to be connected with one or more 
communication channels. We have to specify signal list for 
every particular channel or route. In case of connection 
between channel and route, both signal lists have to be the 
same; otherwise some signals can be lost. 


Ill. ERLANG 


The Erlang programming language has grown as a result of 
a series of experiments aimed to create language suitable 
for programming telephony applications. It is a concurrent 
declarative language with real-time support. There are 
several main characteristics of the Erlang: 
e Functional notation 
e Explicit concurrency 
e Asynchronous message passing between processes 

(very important) 
e Pattern matching 
e Distribution 
e Error handling 
e Module support 
Switching systems require all these characteristics. 
Process and communication between processes are basic 
Erlang concepts. The way of communication is the same as 
the SDL way of process communication. 
Inter-process communication is possible only by message 
passing. We need to know Pid (Process Identifier) of the 
process to which we are sending a message. If we know the 
Pid, we can send the message to the process. 

So, we may conclude that there is a possibility of easy 
conversion of the SDL specification (either SDL/GR or 
SDL/PR) into the Erlang language. 


IV. SDL TO ERLANG TRANSLATOR 


Since relations between SDL and Erlang are considered in 

previous works ([2], [4], [5], [7]), we will try to give an 

overview of our work with SDL to Erlang translator, 

developed at our Department. Some assumptions have 

existed before translator development: 

e Input (SDL specification) is syntactically correct. 

¢ SDL specification is semantically correct; SDL rules 
are not disturbed. 

Our translator doesn’t perform these analyses during code 

generation code, since commercial SDL tool (Telelogic 

SDT) is used for specification. 


Structure of translator is shown at figure 1. 


Translator 


SDL/PR 
specification 


Lexical 
analysis 


token stream 


Error 
handling 


Specification 


4 Messages 
analysis ape 


Data about 
system 


Code generation 


Erlang 
code 


Fig.1 - Structure of translator 


Lexical analysis module performs lexical analysis of SDL 
specification. Lexical analyzer is generated by generator of 
lexical analyzers lex, which is standard part of the Unix 
operating system. 

It reads SDL/GR specification and produces sequence of 
tokens that the specification analysis module uses to 
analyze specification. 

Specification analysis module uses token stream as input. 
It collects all available data about SDL system before code 
is generated. Module behaves as finite state machine. It is 
stable in a state and it waits for the trigger in terms of 
appropriate token. One small part of automata describing 
this module is shown at figure 2. 

Error handling module handles errors. Since specification 
is syntactically and semantically correct, errors can occur 
when translator does not support some parts of SDL 
specification. If possible, those parts will be ignored. If not, 
Erlang code will not be produced. 
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Fig.2 - Part of specification analysis module 


Code generation module generates Erlang code. It uses 

collected data about specification as well as some kind of 

intermediate code. Two important intermediate code’s 

features are required: 

e Intermediate code has to be generated in a simple 
manner from input. 

e¢ Output code has to be generated from intermediate 
code with minimal efforts. 

Since we’re working with languages based on the same 

principles (communicating finite state machine) above 

mentioned requirements are satisfied by input, SDL 

specification. 


VY. DISTRIBUTED FEATURES 


Specification and implementation are expected to be 
independent. During specification phase, it is not necessary 
to know which parts of system will be implemented in 
distributed manner. 

Since Erlang supports distribution, it is desirable to have 
some mechanism that will emphasize distributed 
characteristics during specification of the systems that will 
be implemented using Erlang. Since SDL doesn’t support 


this feature, we have decided to include some information 

about system’s distribution within SDL comments. Several 

papers exist that presented this idea before (see [10]). 

Included information is very poor, but ensure unambiguous 

distributed implementation of Erlang system. Those are 

stored within compiler’s reserved words deSDL and 

deSDLend (distributed SDL and distributed SDL end). 

This ensures that compiler will read this information 

correctly, and will produce distributed Erlang code with 

desired architecture (Fig.3). 

This mechanism provides only limited information: 

e Is the system distributed or not (default is “not 
distributed”). 

e On which Erlang nodes parts of the system will be 
implemented, if system is distributed. 

Erlang allows distribution of processes on different Erlang 

nodes. 


System level 
ike 
deSDL 
distribuded 


deSDLend 
‘ff 


Block level 


ff 
deSDL 

nodename <ime_cvora> 
deSDLend 

af 


Fig.2 — SDL comments 


VI. SIMPLE EXAMPLE 


This is a simple example showing distributed Erlang 
implementation of SDL specification with comments that 
indicate nodes on which processes run. 


SDL specification: 


Block zbr; 

/* 

deSDL 

nodename nodel@kupa 

deSDLend 

a/ 

Ssignalroute rl from zbr to env with rez; 
from env to zbr with plusreq; 

process zbr referenced; 

connect ch2 and rl; 

endblock zbr; 


Process zbr; 
dcl X,Y,Z Integer; 
start; 

nextstate idle; 
state idle; 
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input plusreq(&, ¥)s 
task Z:=X+¥;7 
output rez (2): 
mextstate idle; 
emistate; 


emiprocess zbrz 


Eris =p Jemmentscion. ~ 


—modnle (zbr) - 
—export ([start/0,offspring/1, 
get welue/i,start zbr/1]) - 


= 

= This modul showld be stored 
= om Erlang node nodel@kupa 

= 


start | zbr (E> —_—> 
Offspring=spewun (model @kupa, offspring, 
control, fmall,self()]), 

X=spawn (noedel @kupe, offspring, 

war walue, [mull,self()]), 
¥=spamn (nodel Gkupa, offspring, 

war welue, [null,self()]) 
Z=spewn (model @kupea. offspring, 

war A ya {mmil,self()]), 
L=[(Offspring,X 2 ¥, Zz} . 


idle zbr( (Offspring, xX, Y,2Z])-—> 


L=[Offspring,X,Y,Z], 


{set,get_walue (X)+get_wealue(¥) }, 
et  sucelje, node2 
(rez, get walue (Z) Be 
idle zbr() = 
Other—> 
idle zbr(L) 


VIL CONCLUSION 


In this paper we have presented automatic translation from 


SDL specification t distributed Erlang implementation 
Generated Eriamg code can be run immediately, without 
corrections. Local features were tested by two examples: 


node2@kupa} 


local cal] and call waiting service, implemented as node- 
based service. 

Disiributed features were tested while implementing sample 
system for mathematical operations, when every operation 
was performed on different station, while one station took 
the role of coordimator and interface with user. 

Transistor program is written im C programming language, 
with some parts m C++. Program is implemented on 
SunOS/Solaris Unix platform in X Windows environment 
For SDL specification Telelogic’s SDT tool is used. 
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Telephone Traffic Routing Simulation Using 
Learning Automata 


Mladen Tkali¢, Miljenko Mikuc, Hrvoje Pavici¢ 


Abstract-- Modern fast switching networks are being forced to 
redesign the routing schemes that are used in more traditional 
networks. Recent advances in technology are making more 

_ sophisticated and faster control systems possible. One solution 
is the application of learning automata. Due to these changes, 
a program model of LAG (Learning Automata Games) 
workstation has been under development at the 
Telecommunication Department — Faculty of Electrical 
Engineering and Computing in Zagreb for a number of years. 
The one module of the LAG workstation described in this 
paper is simulating traffic routing in channel switching 
telecommunication networks. 


Index Terms-- learning automata, telecommunication 
network, routing 


I. INTRODUCTION 


The problem of telephone traffic routing has existed since 
the time the first telephone networks were developed. The 
conventional methods used to route telephone traffic are 
based on simple fixed rules defined in each routing table at 
every node in the network. These rules are designed based 
on a prediction or estimation of the actual traffic that will 
occur in a particular network. If the estimations are good, 
fixed rule systems work satisfactorily. Due to its simplicity, 
these fixed routing strategies are the most widespread 
schemes in the existing networks. Changes in technology 
and network sizes have forced us to reconsider this strategy. 
This involves the usage of stochastic learning automata, one 
of the most important tools of learning control 


(2}{3]{4)[12]. 


II. LAG (LEARNING AUTOMATA GAMES) 


In order to explore and take advantage of the learning 
automata we have developed LAG workstation. It is a 
simulation environment for experimenting and comparing 
different properties of learning automata at various network 
states [7][9]. It is implemented on the SUN computers and 
SUN OS operation system. XView Toolkit containing 
OPENLOOK GUI interface provides a user with simple, 
efficient and steady communication. 


M. Tkali¢ is professor at the Faculty of Electrical Engineering and 
Computing, Telecommunication Department, University of Zagreb. 

M. Mikuc (miljenko@tel.fer.hr) is assistant professor at the same 
Department. : 
H. Paviti¢ (hpavicic@pbz.hr), received M.Sc. degree in 1999. in learning 
automata theory. He is with Privredna banka Zagreb, Network department. 


After creating the basic LAG architecture, the requirement 
for more flexible user communication interface was 
imposed. It has been achieved by the open way of the 
connection between modules. Such organization of program 
modules makes them open for further extensions. Programs 
have been written in C and C++ program languages. 
Modules can be divided in two groups. The first one is 
made of INSTRUCTIONS, TOPOLOGY CHOICE and 
DIAGRAM FLOWS modules and their purpose is 
acquainting with the whole workstation concept and ways 
of its usage. The second one is made of AUTOMATA 
GAMES module which presents simulation of the learning 
in the random stationary environment, and SELF- 
LEARNING AUTOMATA TESTING (SAT) module 
which is the simulation of the network operation using a 
various traffic routing methods [8][10]. 

The latest designed module was object-oriented module 
named ROUTING which has been written in MS Visual 
C++ and implemented on the computers running Windows 
operation system. It was designed to simulate any telephone 
traffic routing problem in telephone channel switched 
network using a variety of static and dynamic routing 
methods based on learning automata [7][10]. Routing 
module is builded on the basic principles of SEATER, 
program developed by Zgierski and Oommen [1]. 


A. Laboratory Design 


We have organized our Automata Theory Laboratory 
around Unix, SunOS based workstations and PC platform, 
Windows NT. Currently, routing modules are implemented 
only on NT, and some modules are running only on Unix. 


NT server 


Fig. 1 - Automata Theory Lab 


The most important goal is to provide transparent results on 
all computers and build network knowledge database based 
on simulation results. 
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B. Learning Automata Concept 
The concept of learning automata was introduced in the 
early 1960°s. A learning automaton consists of automata 
having a finite set of actions connected in a feedback 
configuration to a random environment. The automata 
perform an action into the environment and update its 
strategies on the basis of the random response of the 
environment. 


ENVIRONMENT 
2 


AUTOMATA 
® 


Fig. 2 — Learning automata 


The learning process implies the recognition of optimal 
actions. Hopefully, the automata will eventually choose 
these optimal actions more frequently than the other ones 


[6}{2}{5}{4}- 


III. ROUTING MODULE 


In this chapter we will describe the process of creating 

telecommunication network model, simulating traffic 

routing in it and presentation of results. Routing module 

allows the user to graphically specify the topology of the 

telecommunication telephone network. 

Process of designing the network is: 

** defining the topology graph (nodes and links), 

** defining the call groups, 

** defining the senders (number and routing schemes), 

** defining the main simulation values (number of 
experiments, number of time units in experiment, 
impulse cost, etc.). 


A. Topology graph 
The first step in the process of creating a network model is 
defining a topology graph, which contains nodes and two- 
way links (Trunk groups). Each node represents a location 
or call switching center and they are interconnected by one 
or more two-way links. 


=~ Untitled - Routing Pale! ES 


Fie Edt View Select Help 


Sisiala) le|fovieial AAS el ee] 


Fig. 3 — Defining a topology graph 


The positioning of nodes in the drawing area is 
accomplished by selecting the insert and node drawing tools 
and clicking the mouse button at a desired place. 


Fig. 4 — Creating a link 


The capacity of the link we denote by |, which indicates 
the number of trunk lines available to this link. The link 
capacities of the entire network can be described by L, 
which is a matrix whose individual elements are |. If we 
select the insert and trunk group drawing tools and click on 
two nodes, the link is created and drawn, and the program 
expects the value of capacity to be entered. 


B. Traffic 
After we create the topology graph we have to create call 
groups which represents positions of traffic input and 
output points. Every call group contains the following 
values: input node, output node, input traffic distribution, 
holding time distribution and tariff. 


Fig. 5 — Creating a call group 


We create call group by selecting insert and the call group 
drawing tools and clicking the mouse button on the traffic 
input node. 


C. Senders — The Routing and Control Structures 


At each node, entities (which are typically algorithms) 
called senders process a calls and send it along the one of 
trunk lines inside the link. Routing algorithms available for 
routing processes in every sender are: Fixed Roule, Random 
Routing, M-Automaton, Lrp (Linear Reward Penalty 
Automaton), Lr.; (Linear Reward Inaction Automaton), L).p 
(Linear Inaction Penalty Automaton), DLp-p (Discrete 
Linear Reward Penalty Automaton), DLp.; (Discrete Linear 
Reward Inaction Automaton), DL;.p (Discrete Linear 
Inaction Penalty Automaton), ADLp.p (Absorbing Discrete 
Linear Reward Penalty Automaton), ADL).p (Absorbing 
Discrete Linear Inaction Penalty Automaton) and S-type 
automaton for all automaton types. 


A, — intensity of calls of call group from node i to node j 
Six — Sender of call group (i, j) in node k 
ijk — Queue of call group (i, j) in node k 


Fig. 6 — Sender role in the node 


Every node for routing calls from all call groups has to 
contain senders for every call group. One sender can serve 
only one call at a time, so if we want speed up serving of 
calls we need more senders of the congested call group in 
the node. 


D. Simulation results 


Due to the stochastic nature of the process, the results of 
each experiment have been made on the basis of 300 
independent experiments. Each experiment was carried out 
for a total time of 1000 time units. 


Call Probabilities Link Capacities 
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Fig. 7 — Graphical presentation of blocking probability 


We can see from fig.9 that results are calculated for many 
different parameters. All of them are available in graphic 
and table format. They are divided into two main groups: 
call group parameters and link parameters. For call group 
parameters simulation calculate traffic (blocking) and 
money (gain, cost) values for each node. 

Simulation results clearly demonstrate that ergodic learning 
automata solutions are far superior to the traditional fixed 
rule solutions because of the fact that they can adapt to 
changes in telephone traffic patterns. Also, S-type automata 
outperformed their non S-type conterparts in every 
environment. 
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Fig. 8 — Graphical presentation of lost calls distribution 


TV. CONCLUSION 


The primary subject of this paper is to present the benefits 
of using simulation environment in designing channel 
switched telecommunication networks, which require a 
problem specific analytical phase. LAG workstation is a 
software tool for network planning and design, developed to 
enable quantitative verification of alternative strategies. It 
was implemented in Automata Theory Lab as platform for 
testing of models and methods gained from the theory of 
learning automata games. LAG workstation enables testing 
of various network topologies and different routing 
methods. As computer technology continues to advance, 
simulation will be used to evaluate more complicated 
network mechanisms. Designing a simulation environment 
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for packet switched telecommunication network is our next 
step. 
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Fig. 9 — Results for one routing algorithm 
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Linear Network Analyser Design with 
Heterogeneous Component Composition 


Bruno Blaskovi¢, Petar Knezevi¢, Mirko Randi¢ 


Abstract— This paper describes the development of lin- 

ear network analyser software. The software development is 
based on coordination between heterogenous software com- 
ponents within (CO)*PE metamodelling environment. 
As solution, the language for component coordination is in- 
troduced. Coordination formalism is based on high level 
Petri nets providing great flexibility and reusability during 
component composition. 


Keywords— metamodelling, coordination, linear network 
analysis 


I. INTRODUCTION 


UR aim is to develop the methodology for tool de- 
sign and exercise it through linear network analyzer 
design. We treat the following three points: 
- the design of distributed system with coordination based 
model 
- heterogenous components composition 
- linear network analyzer design 
First, requirements for linear network analysis are intro- 
duced. The requirements are tailored to meet the needs 
for educational purposes. The network topology consists 
of RLCM elements [1] with AC or DC voltage/current 
sources. 
Further on, the language for network description is intro- 
duced. The language syntax follows LISP-like syntax im- 
plemented in CLIPS expert system shell, while semantic 
issues can be interpreted through CLIPS functionality. 
After Introduction, Scope and Motivations for such ap- 
proach are described. Next, the model of coordination is 
derived from (CO)*PE metamodelling framework. At the 
end, small example is given together with BNF notation 
for language syntax. 


II. SCOPE AND MOTIVATION 


Consider the simple example from fig. 1. Given the net- 
work topology as well as the element values, experienced 
user can easily calculate unknown currents and voltages. 
But to become experienced user some education efforts are 
needed. In order to acquire such skills one must undertake 
the courses with names like Introduction to Electrical Cir- 
cutts, Electrical Network Analysis or Fundamentals Elec- 
trotechnic. 

Our scope of LInear Network Analyser software (LINA) 
design is primary the usage during educational process. 
Besides pencil and paper such networks can be solved by 
the means of computer programs. 


Bruno Blaskovié, Petar Knezevié Mirko Randié are with the Faculty 
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mail: bruno.blaskovic,petar.knezevic,mirko.randic@fer.hr . 


Fig. 1. DC network example 


If languages like C or FORTRAN are used student will spent 
most of time compiling coding and finding why the program 
is not doing what is supposed to do. Another approach is 
to take some commercially available tools like Mathemat- 
ica ©, MathCAD ©, Mathlab ©. They are very suitable 
to general problems solving. Besides that, each of them 
has libraries with predefined problem solutions. 

Our approach is slightly different. In stead of selecting one 
of the tool and trying to use it for educational process, the 
set of tools will be used. 

There are several components necessary to meet desired 
functionality of the tool. Each of this component is stand 
alone tool, set of subroutines or specialized language: 

- language parser: LEX/YACC tool in Perl implementa- 
tion 

- equation solver: set of C routines implemented in octave 
package 

- graph plotter: GNUplot 

- graphic user interface: TclTk interpreter 

- command interpreter: CLIPS or Perl routines 


Coordination between the components is formally modelled 
as high level Petri Net. Each transition in high level Petri 
net is a component, while states are modeling component 
preconditions. Preconditions are necessary for data flows 
verification between components. For example, equation 
solver must have n equations with n variables. If that is 
not true verification will fail because the precondition fails. 
User interface is introduced as language. Each command 
in the language is calling components. Network topology, 
components and desired actions taken to solve the network 
are all defined within the language. 

Benefits of such approaches are: 

- heterogenous components with different paradigms are 
mixed together in consistent way 

- high level Petri Net model analysis is suitable for require- 
ment validation through model checking 

- rapid prototype development 
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- previous experience from model development during pro- 
tocol synthesis is used 


A. Previous work 


During intelligent network protocol synthesis the mod- 

elling framework is [2] introduced. Elementary Commu- 
nicating Processes were components of the protocol. The 
process of synthesis consists of connecting components into 
the protocol regarding requirements and constraints. The 
whole process is heavily model checked against possible 
protocol errors. 
As a generalisation, metamodelling framework (CO)*PE 
is developed. In our approach network analyser is build 
from components (named Elementary Communicating Pro- 
cesses) into the network analysator. 


B. Related work 


UML from OMG is powerful system [3] suitable for mod- 
elling and design of software systems. Its relation with 
OMG group CORBA [4] makes it the natural candidate 
for network application design. 

More formal approach is given in [5] regarding the composi- 
tion of components, while [6] and [7] are platforms suitable 
for development like ours. 


C. Coordination models 


Coordination models are formalism for design of concur- 
rent systems [8]. 
Coordination languages compose the a system from the 
components. Although our network analysator is not pure 
distributed software coordinated paradigm is find very suit- 
able for software development. 
Previously developed, mostly heterogenous software pieces 
are connected together. The general idea, described in the 
next section in more details is software reusability in coor- 
dinated environment. In our case coordination formalism 
is exercised through high ievel Petri nets and used for the 
composition of the system from components. 
If the software system is to be developed from the begin- 
ning, developers are always oriented to software libraries. 
On the other side, libraries are always more or less plat- 
form dependent, thus requiring efforts to adjust them for 
particular application. 
With coordination languages different paradigms are mixed 
together. 
Natural candidate for different languages as well as 
paradigms interworking is Perl extension mechanism [9]. 
In order to design coordinated language based system in 
consistent way, model is described in the next section. 


III. 


As mentioned in the previous sections, (CO)4PE meta- 
model serves as the basis for our case based model. 
The purpose of the model is to define reliable environment 
for linear network analysis software design. 


THE MODEL 


The first step is to define 3-level model. After that, lan- 
guage syntax as well as semantics is introduced. 


A. (CO)*PE metamodel 


In [2] (CO)*PE 3-level metamodel is introduced. The 
levels are abstract, operational and implementation level. 
Roughly speaking, abstract model is the place where re- 
quirements are introduced and analysed. 

The most important is to define the set of Elementary 
Communicating Processes (ECP) and how are ECP com- 
posed into the system. Another important thing is the def- 
inition of function each ECP must perform. 

Operational model can be viewed as specification level. On 
this level the algorithms or processes for each ECP are in- 
troduced. Further on, the message flows between EC P’s 
are introduced. The composition of ECP’s is achieved 
through the specialised language introduced in this level. 
The set of high level Petri nets describes the language se- 
mantics. 

Implementation model introduces the mechanism for co- 
ordination of EC P’s through syntax and semantics of the 
language. Here are data structures and command’s syntax 
defined. The set of high level Petri nets from operational 
level are implemented thus providing command execution. 
We shall not describe in depth the theory that is be 
hind (CO)4PE metamodel. Each level consist of: set 
of ECP’s and their composition composECP. Re 
garding the level, composECP is called semantic or 
abstract/operational/implementational net or structure. 
ECP is triple: 


ECP (data, proccess, semantic) 


where: 
data information flows visible outside ECP 
process proces, function ECP handles 


semantic semantic of ECP 


On all levels composECP or nets are triples: 


composEC' P(ingo_flows, process_pairs, semantic) 
where: 


info_flows information flows between 
all ECP’s 

process pairs set of ECP connected pairs 

semantic net of EC’P’s describing 


system’s functionality 


The model for linear network analysator can be described 
as the set of three net triples with three sets of ECP triples. 


B. Abstract model 


(CO)*PE is metamodelling framework. So, not all ele- 
ments has equal importance. The most important thing in 
abstract model is to identify the set of cooperating func- 
tions or processes as well as the base architecture for their 
interworking. The set of functions or processes which will 
be considered as candidates for EC P’s definition is: 


1. kernel which will call other functions, 
2. functions for numeric calculations 
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3. functions for numeric network analysis 

4. functions for interface with symbolic solver 

5. functions for visualisation 

6. functions for ploting functions 

Information flow between them will be consider after the 
selection of existing and avaliable software modules on op- 
erational level. 

Each group of function will be packed into an ECP. So to- 
tal of seven EC'P’s is introduced in this phase. Two kinds 
of semantic are identified regarding abstract net: 

- static semantic (shown on fig. 2), for 3 ECP with kernel 
(shell), 

- dynamic semantic. 


absECP_num 
(calc) 


absEC P_vis—out 
(plot) 


absECP_shell 
(command) 


absECP_vis—in 
(draw) 


Fig. 2. static semantic in the abstract model 


Dynamic semantic can be described as set of sequences 
like this: kernel -> 2, —> kernel—> 22. -—>..., 
where 21, 2%2,... are EC'P’s other than kernell. 


C. Operational model 


In operational model, the first step is to define algorithms 
as the solutions for functions introduced in abstract model. 
For that purpose existing software packages are used like 
perl [9] for programming the kernell (shell) and command 
parsing, octave [10] for numerical calculations, gnuplot [11] 
for plotting the functions, TclTk [12] for graphical input, 
and CLIPS [13] for interface to symbolic functions, TclTk 
[12] for graphical input, and CLIPS for interface to sym- 
bolic functions. 

Algorithms are then developed using pseudo code. Graph 
algorithms necessary for development are taken from [14]. 
The semantic part of ECP definition is build into the cho- 
sen software packeges. 

Operational net defines the language for coordination. 
Information_flows are language data structures, introduced 
in BNF notation. process_pairs are defined through the set 
of language commands, while command semantics is de- 
fined through the set of high level Petri nets. 


D. Implementation model 


With operational model defined, implementation model 
is just coding of the specification. ECP functions 


are implemented as scripts that are specified in op- 
erational level so command can, for example, juct 
call octave filewithscript arg. Language syntax is 
the part of Implementation net. The semantics of 
Implementation net is executable high level Petri net in- 
troduced in operational level. 
Input language follows sintax rule from NASA CLIPS ex- 
pert shell [13]. The language is implicit coordination lan- 
guage because there are no explicit commands for coordi- 
nation. The most significant parts of BNF’s are given in 
the APPENDIX. The samantics of the language is defined 
through high level Petri nets introduced in the operational 
level. As an example on fig. 3 typical command execution 
is presented. Command is parsed in lex — yacc transition 
(the part of kernel ECP) and then interpreted regarding 
all system, internal and other data in system tables. Er- 
rors are supported in error part of kernell ECP. After 
that calculi is invoked in EOP supporting numerical anal- 
ysis, or solve in ECP supporting symbolic analysis. 

The whole modelling efforts can be summarised as consis- 


LEX-YACC | enor 


Fig. 3. LINA command semantic as high level Petri Net 


tent data acquisition for calling problem solving software. 
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IV. AN EXAMPLE 


Linear DC network from fig. 1 serves as an example to 
illustrate the possibilities of network analyzator. After en- 
tering the interactive mode, the network is load with the 
following command: 

(load "3AB.1kontrolna_zadaca_0E1.LINA") 

The content of the file is checked with: 

(print net) 

producing the following listing: 

(defNET test "MEleCON2000 example" 

(Ri 1 3) (R3 200) (R5 2 3) 

(NODEa Ri (UDC 20) NODEb) 

(NODEb GND) 

(NODEa (R2 10 3) NODEc) 

(NODEc (R4 20) NODEb) 

(set BR5 NODEc NODEb) (BR5 R5) 

(NODEc BR5 NODEb) ) 

(R1 1 3) means R, = 1kN and (R3 200) is for Rs = 2002. 
(NODEa (R2 10 3) NODEc) is Ro = 10k between the 
nodes a and b. The branch from node c to node b con- 
sists of single Rs = 2k. 

Now, some commands are typed: 

(set U3 NODEa NODEc) 

(calc U3) 

(plot U3 R5) 

First U3 voltage is introduced as U,-. The value is calcu- 
lated and graph plotting of U3 initiated. Range variables 
are taken from system’s defaults. 

At the and, the whole sequence is saved into the file for 
future reference: 

(save "Melecon2000.dc.test") 


V. CONCLUSIONS AND FURTHER STUDY 


An implicit coordination language for linear network 
analysis is introduced. Such approach minimize developm- 
net cycle as much parts of the system are reused software. 
(CO)4 PE metamodel approach is used for software devel- 
opment, thus allowing great flexibility during the develop- 
ment. 

Future work will concentrate on (1) (CO)*PE metamodel 
development and (2) adding parallel routines into the anal- 
yser development. 
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APPENDIX: BNF synTAx FoR LINA LANGUAGE 


<program> ::= <tuple>* | 
<tab> | <space> | <newline> 
<tuple> ::= <element> | 
<command> | 
<constr> | 
<comment > 
<element> ::= <r> | <l1> | <c> | <m | 
<xl> | <xc | <bl> | <be> | 
<xm> | <m | 
<z> | <y> | <g> | 
<v> | <a> | <w> | <ohm | 
<udce> | <uac | 
<idce> | <iac> | 
<gnd> | <sc> | <oc> | 
<node> | <branch> | 
<u> | <i> 
<x> ::= R<lexem | 
*(? Rélexem> <fl> ’)’ | 
7(? Ré<lexem> <fl> <fl> ’)’ | 
*(? Rélexem> <fl> <fl> ’)’ 


<xl> ::= XL<lexem> | 
»(’? XL<lexem> <fl> ’)’ | 
7(? XL<lexem> <fl> <fl> ’)’ | 
*(? XL<lexem> <fl> <fl> ’)’ 


<node> ::@ NODE<lexem> | 
*(? NODE<lexem> <lexem>+ 
<satring>* ’)’ | 


<command> ::= <set> | 
<draw> | <plot> | 
<solve> | <calc> | 
<print> | <list> | 
<load> | <save> 


<cale> ::= 7(’ cale <string> ’)’ 

<constr> ::= <defnet> | 
<defdefault> | 
<defacript> | 
<defmacro> 


<defnet> ::= ’(’ defNET <lexem> <string> 
<string> <branch>+ ’)’ 
<branch> ::= ’(’ BR<lexem> <element>+ ’)’ | 
*(? NODE<lexem> <element>+ 
NODE<lexem> ’)’ | 
’(? NODE<lexem> BR<lexem> 
NODE<lexem> ’)’ 
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Performance Tuning of Large-Scale Distributed 
WWW Caches 


Sinisa Srbljic, Andro Milanovic and Nikola Hadjina 


Abstract — World Wide Web (WWW) caches, such as the 
Harvest cache, its successor Squid, and a cache proposed by 
Malpani, Lorch, and Berger from the University of California, 
Berkeley, (referred to hereafter as the Berkeley cache) 
implement proxy to proxy communication. This 
communication mechanism unifies the communicating proxy 
caches and makes them perform as a single cache that is 
distributed over multiple proxy machines. 


In this paper, we investigate how the size of the 
distributed cache affects its performance. We compare the 
performance results for two different distributed caches: 
Squid and Berkeley. We show how to choose the number of 
proxy machines in a distributed cache in order to avoid 
network congestion and prevent overloading of the proxy 
machines. 


Index Terms — Distributed WWW cache, distributed cache 
management protocol, performance tuning. 


I. INTRODUCTION 


World Wide Web (WWW) proxy caches are 
introduced to avoid the bottlenecks of the classical 
client/server Internet architecture. However, WWW proxy 
caches do not improve the performance of distributed 
systems as much as private caches improve the performance 
of multiprocessor systems. A low cache hit rate is one of 
the main reasons for the poor performance of WWW 
caches. It is well known that multiprocessor caches have hit 
rates over 0.998, while the hit rates of the WWW proxy 
caches are not much over 0.30. Experience has shown that 
connecting a larger number of clients on a proxy machine 
can increase the cache hit rate. However, the number of the 
clients per proxy machine is limited. Therefore, the Harvest 
cache [1] introduces the idea that proxy machines should 
communicate in order to increase the cache hit rate. Before 
sending the request to the original server, the proxy 
machine should check if the copy of the requested object 
exists at nearby proxy machines. The proxy-to-proxy 
communication distributes the WWW cache over multiple 
proxy machines that cooperate in order to return the 
requested object. Since we can increase the number of the 
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proxy machines and the number of clients in a distributed 
cache, this can increase the cache hit rate. 


There are three known solutions for a distributed cache 
management protocol (DCM protocol). The Harvest cache 
[1] and its successor the Squid cache [2] are based on a 
hierarchy of communicating proxy machines. Alternative 
approaches to the design of the distributed proxy cache are 
described in [3]. We will denote this protocol as the 
Berkeley protocol. While details of DCM protocols can be 
found in [1]-[3], we only describe the basic algorithms of 
the DCM protocols in this paper. 


Since the main design issue is scalability, in this paper 
we present how to determine the number of proxy host 
machines in order to tune the performance of the distributed 
WWW cache. In Section 2 we describe, evaluate and 
compare current DCM protocols. Section 3 introduces the 
graphs used to present results of the performance tuning 
process. Section 4 discusses the results of the tuning 
process for different system parameters and for different 
application parameters. Final comments on the results of the 
performance tuning are given in the last section. 


I]. PERFORMANCE EVALUATION AND COMPARISON OF 
DCM PROTOCOLS 


A DCM protocol consists of five major algorithms: the 
admission algorithm, the search algorithm, the decision 
algorithm, the replication algorithm, and the termination 
algorithm. Since the cache is distributed across multiple 
proxy machines, the admission algorithm must decide to 
which cache the client should make the first request. We 
will denote the proxy machine that runs the chosen cache as 
the master proxy to be consistent with terminology in [2]. 
When the proxy machine receives a request, and the local 
cache does not have a copy of the requested data, the DCM 
protocol must start its search algorithm in order to find a 
valid copy in one of the other cooperating proxy machine 
caches. We will call these cooperating proxy machines 
neighbors. After the DCM protocol finds out where the 
valid copies of the requested object exist, it must decide 
which neighbor proxy machine will return the copy. 
Finally, the replication algorithm determines if the copy 
should be cached and replicated at the master proxy 
machine. If there is no copy in the distributed cache, the 
request for the object is sent to the original server. The 
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termination algorithm decides when to stop the search at 
the neighbor proxy machines and send the request to the 
original server. 


In this paper, we present how three main algorithms 
affect the performance of the distributed cache: the search 
algorithm, the replication algorithm, and the termination 
algorithm. The Squid protocol and the Harvest protocol use 
the same algorithms, and therefore, we present only the 
results for the Squid protocol. The search algorithm is based 
on broadcasts: the master cache broadcasts the request 
message to all neighbor caches one by one. The neighbor 
caches respond either with a it or a miss, depending on 
whether they have the requested object or not. On the first 
hit message, the master cache issues a gef request to that 
neighbor cache. The replication algorithm stores a copy of 
the requested object at the master cache. The termination 
algorithm counts the number of miss responses. If all 
neighbors respond with a miss, or the preset timeout period 
is over, the request for data is then sent to the origin server. 


In order to reduce the network traffic, the Berkeley 
protocol [4] uses IP multicasts. During an IP multicast, the 
master cache sends only one UDP packet, and not multiple 
packets as in case of the broadcast. To additionally reduce 
the network traffic, the master cache does not replicate the 
requested object in the case of a neighbor cache hit and 
neighbor caches that do not have the requested object do 
not send back miss responses. The termination algorithm is 
based only on the preset timeout period. These features 
additionally reduce the network traffic, because the 
requested object is sent only once through the network (in 
contrast, the Squid protocol sends the requested object 
twice through the network, once from the neighbor cache to 
the master cache, and once from the master cache to the 
client) and there are no miss responses from neighbors that 
do not have the requested cbject. 


Berkeley 
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Av. latency per request (s) 
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Figure 1: Performance comparison 
of Squid and Berkeley cache 


As a performance measure, we chose the average 
latency per request. Figure 1 presents the average latency 
per request for two different DCM protocols’: Berkeley and 
Squid. The results are generated by an analytical model 
developed in [5] and compared to the results of a large- 
scale network simulator developed in [6]. 


For both decreasing and increasing numbers of proxy 
machines, the distributed cache saturates. For a decreasing 
number of proxy machines, the proxy machines are 
overloaded and they are not able to handle all the input 
requests. However, increasing the number of proxy 
machines also increases the amount of proxy-to-proxy 
communication, which causes network congestion. 


As we can see from the Figure 1, the Berkeley protocol 
has a much wider interval for which it does not saturate 
(between 6 and 265 proxy hosts) than the Squid protocol 
(between 10 and 32 proxy hosts). Since the Berkeley 
protocol is based on an IP multicast and does not use miss 
responses and object replication, it generates lower traffic 
and lower proxy load, which increases the non-saturation 
interval. Although the Squid protocol generates higher 
traffic and a higher proxy load, the miss responses and 
object replication significantly reduce the latency. While 
the miss responses help stop the searching even before the 
preset timeout period expires, data replication increases the 
master cache hit rate. Both non-saturation intervals show 
that the latency does not change significantly while the 
DCM protocol does not saturate. Therefore, it is important 
to determine the non-saturation intervals for different values 
of system and application parameters in order to tune the 
performance of the distributed cache. 


' Figure | presents the performance results for the 
following distributed cache configuration: the total number of 
proxy hosts, NV, in the distributed cache ranges from 2 to 300, the 
average cross-sectional bandwidth of the network is 1024 Mb/s, 
the average bandwidth of a proxy host is 100 Mb/s, the total 
number of client hosts is 9100, the average number of requests per 
client per second is 0.53, the average bandwidth ofa client is 10 
Mb/s, the average size of the object is 10 kB, the total number of 
server hosts per distributed cache is 3, the average bandwidth of a 
server host is 100 Mb/s, the number of subnets that connect the 
proxy hosts D increases with the number of proxy machines 
D=max(1, 0.2xN), the timeout period used by termination 
algorithm is 40 milliseconds, the IP packets are of three typical 
sizes (40 bytes, 180 bytes, and 1024 bytes), the master cache hit 
rate is 0.25, and the neighbor cache hit rate is 0.30. We also use a 
number of other DCM protocol and TCP/IP parameters [5] that are 
not important for the discussion presented in this paper. 
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III]. GRAPHICAL PRESENTATION OF PERFORMANCE TUNING 
PROCESS 


Figure 2 gives an example of the graphs used to present 
the results of the tuning process. For a particular DCM 
protocol, the graph shows the non-saturation intervals for 
different values of a system or an application parameter. 
The non-saturation intervals for different values of the 
system or application parameter define the non-saturation 
area. Hereafter the non-saturation interval is presented as 
triple: (system or application parameter value, lower non- 
saturation interval bound, upper non-saturation interval 
bound). 


To the left of the non-saturation area is the overloaded 
proxies saturation area. For a given number of proxy host 
machines and for a given value of the system or application 
parameter, the proxy machines are overloaded, which 
causes the DCM protocol to saturate. To the right hand of 
the non-saturation area is the congested network saturation 
area. For a given number of proxy host machines and for 
given values of the system or application parameter, the 
communication network is congested, which also causes the 
DCM protocol to saturate. 


Next section presents the results of the tuning process 
for two different system parameters (network bandwidth 
and proxy host bandwidth) and for two application 
parameters (number of clients and average object size). The 
results are given for both the Berkeley protocol and the 
Squid protocol. 


System or 


application 
parameter Overloaded Congested 
values Proxies Non- Network 
Saturation | saturation Saturation 


Area Area 


area 


Number 
of Proxies 


Figure 2: Saturation and non-saturation areas 


IV. PERFORMANCE TUNING FOR DIFFERENT SYSTEM AND 
APPLICATION PARAMETER VALUES 


Figures 3a-b present the non-saturation areas for four 
system and application parameters: decreasing cross- 
sectional network bandwidth, decreasing proxy host 
bandwidth, increasing number of the clients, and increasing 
average size of the object. 


As we can see from figure 3a, the decrease in the cross- 
sectional network bandwidth does not affect the lower 
bound of the non-saturation area, while the upper bound 
decreases. For the Berkeley protocol, the typical values for 
the given distributed cache configuration are: (1024 Mb/s, 
6, 265), (800 Mb/s, 6, 125), (625 Mb/s, 6, 16), and (611 
Mb/s,-,-), where the first value is cross-sectional network 
bandwidth and other two values are the lower and the upper 
bound in the number of proxy host machines, respectively. 
Once the cross-sectional network bandwidth drops to the 
value of 611 Mb/s, it is not possible to choose the number 
of proxy hosts and to avoid the saturation of the Berkeley 
protocol. The typical values for the Squid protocol are: 
(1024 Mb/s, 10, 32), (900 Mb/s, 10, 20), (850 Mb/s, 10, 
15), and (825 Mb/s,-,-). 


As we expect, if the proxy host bandwidth decreases 
(see Figure 35), the upper bound does not change, while the 
lower bound increases. The typical values for the Berkeley 
protocol are: (100 Mb/s, 6, 265), (40 Mb/s, 17, 265), (10 
Mb/s, 157, 265), and (9 Mb/s,-,-), where the first value is 
the proxy host bandwidth and the two other values are the 
non-saturation interval bounds. The typical values for the 
Squid protocol are: (100 Mb/s, 10, 32), (60 Mb/s, 20, 32), 
(50 Mb/s, 27, 32), and (47 Mb/s,-,-). 


The increasing number of clients changes both the 
lower and the upper bound of non-saturation interval: the 
lower bound increases and the upper bound decreases. The 
typical values in Figure 3c are: (9100 clients, 6, 265), 
(12100 clients, 9, 107), (13100 clients, 10, 70), and (14840 
clients, 11, 11) for the Berkeley protocol and (9100 clients, 
10, 32), (9700 clients, 11, 26), (11100 clients, 13, 15), and 
(11124 clients, 14, 14) for the Squid protocol. The first 
value of the triple denotes the total number of the clients per 
distributed cache. 


Figure 3d shows that both the lower bound and the 
upper bound change if the average size of the object 
increases. The typical values are: (10 KB, 6, 265), (13 KB, 
8, 159), (16 KB, 10, 53), and (16.61 KB, 11, 11) for the 
Berkeley protocol and (10 KB, 10, 32), (12 KB, 12, 19), 
(12.6 KB, 12, 15), and (12.77 KB, 13, 13) for the Squid 
protocol. The first value of the triple denotes the average 
size of the object. 
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Figure 3: Non-saturation areas for different system/application parameters 


V. CONCLUSION 


In order to tune the performance of a large-scale 
distributed WWW cache, we should determine the number 
of proxy host machines for which the distributed cache does 
not saturate. The protocol that manages the distributed 
cache has a major effect on the performance tuning process. 
The Berkeley DCM protocol does not saturate over a wide 
range of numbers of proxy machines, which simplifies the 
performance tuning process. On the other hand, the Squid 
DCM protocol does not saturate over a lower range of 
numbers of proxy machines, but incurs lower latency. 
Therefore, the performance of a WWW cache may be 
improved if the Squid DCM protocol is used, but the 
number of proxy host machines should be precisely 
determined. 


In addition, we show how four major parameters affect 
the performance tuning process: cross-sectional network 
bandwidth, proxy host bandwidth, number of the clients, 
and average size of the object. If the cross-sectional 
network bandwidth decreases and all other parameters 
remain the same, the number of proxy machines should be 
decreased in order to reduce proxy-to-proxy 
communication. If the proxy host bandwidth decreases, the 
number of proxy machines should be increased in order to 
reduce the proxy load. The increase in the number of clients 
or the increase in average size of objects affects both the 
upper and the lower bound of the number of proxy host 
machines: the lower bound should be increased and the 
lower bound should be decreased. Although we assume that 
the cross-sectional network bandwidth is high (1024 Mb/s) 
and that the proxy host bandwidth is moderate (100 Mb/s), 
the increase in the number of clients or the average object 
size affects the upper bound more than it affects the lower 
bound. Therefore, the performance of distributed cache is 
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much more sensitive to the proxy-to-proxy communication 
than to the proxy load. 
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Moore Reserves for Orthogonal Networks. 
A Case Study 


Cristian Lupu and Andrei Hagiescu 


Abstract—We consider dividing interconnection spatial 
locality of a direct interconnection network into structural and 
functional localities. For structural locality is proposed a 
simple analytical measure based on Moore's bound: Moore 
reserve [8]. For direct interconnection networks with same 
number of nodes, it formulates a topological design/evaluation 
principle: the Moore reserve is bigger, characteristics of 
structural locality are poorer. In the case study are presented 
results of evaluation of hypercubes and hypertori with 
maximum 10 dimensions and 50 nodes per dimension. This 
means tables and graphs together with comments and 
conclusions. 

Index Terms—Interconnection Spatial Locality, Structural/ 
Functional Localities, Direct Interconnection Networks, 
Topological Design/Evaluation, Hypercubes, Hypertori 


I. DEFINITION OF MOORE RESERVE 


Between the most important quantitative topological 
principles to design/evaluate networks is _ the 
interconnection locality [2, 3, 4, 5, 6, 12]: Interconnection 
spatial locality divides into structural and functional 
localities [8]. To evaluate the structural locality of a direct 
interconnection network (DIN), besides the neighborhoods 
[7], we propose a simple measure: the Moore reserve based 
on the Moore's bound [8]. As knows, the Moore bound is 
given as the maximum number of nodes which can be 
present in a graph of given degree / and diameter D: Nujoore= 


=, )0-1?-1 : 
=1+/ oe remeet [10]. This bound is deduced from a 
complete /-tree with diameter D and is an absolute limit for 


a diametrical structural neighborhood, SNp = $y, , in any 
i=l 

graph of 1 degree and D diameter. Except the complete 

l-ary trees, this bound is rarely*reached. Petersen graph, 

fully connected networks and rings with odd number of 

nodes are interconnections which reach the Moore's bound. 

Therefore, it makes sense to compute for the direct 
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interconnection networks how far is this bound: the farther 
away Moore's bound the structural locality properties are 
worse. This is implemented by the Moore reserve. 

Moore reserves compute exactly only for regular DIN, 
having the same degree [2]. For irregular structures, having 
variable degree, an approximation is to compute Moore 
reserve taking maximum degree. To punctual and structural 
neighborhoods [7, 8] correspond punctual Moore reserve 
and, respectively, proper Moore reserve. Firstly, for the 
punctual Moore reserve, we'll use the number of 
descendents in a Moore tree at the distance d, Namoore = 
=(1 — 1)*~', secondly, for the Moore reserve, we'll use the 
total number of descendents in Moore tree at the distance d, 
Nwoore(a) = Cane c 

Definition 1.1. Punctual Moore reserve at the distance d 
in a DIN, PMR,, is defined by the difference between 
the number of nodes in a corresponding Moore tree at 
the distance d, /(7— 1)*~', with the degree in considered 
network, and the punctual neighborhood in considered 
network: 
PMR, = INeiteore —Nz d=1, seey 1D). 
Definition 1.2. Moore reserve at the distance din a DIN, 
MRz,, is defined by the difference between the Moore 
bound at the distance d and the structural neighborhood: 
MRi=Nijoon(@) — £.N;, d=1y-x D: 
i=l 
Definition 1.3. Diametrical Moore reserve is: 
MRp=Nyoore(D) — SNp . 

To compare various DIN we introduce normalized 
distance, d=d/D, defining normalized punctual Moore 
reserve, PMR;, and normalized Moore reserve, MR5. 

For DIN with same number of nodes, we can formulate a 
topological design/evaluation principle: the normalized 
Moore reserve is bigger, the characteristics of structural 
locality are poorer. The principle is the same for punctual 
and diametrical Moore reserves. 


IJ. EVALUATIONS FOR SOME ORTHOGONAL AND REGULAR 
DIN 


Firstly, we'll give general formulae to evaluate Moore 
reserves for some orthogonal and regular networks: 
generalized hypercubes, hypercubes and hypertori [1, 2]. 
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For generalized hypercubes, GHC, with N= Te: , where 


i=] 
the degree is I= ¥3(m;-1) and Ng = (mj 1m omg = 1) 
i=l 
(number of terms of the sum is ("), number of factors for 


product is d), general formula for Moore reserve 
MRAGHC) is: 
MR, (GHC)= ¥(m-1)x 


i=] 


: d 
[Lim -0-1 =] 
x a > (m; Yom mg -1)-+- 
Sy -De2 


t=] 
For hypercubes, HC, with N=m" where the degree is 
l=r(m — 1) and N=V, |m 1/4, general formula for Moore 


reserve MR,(HC) is: 


MR, (HO)= rin ik —D=IE =! _ Sy, 
r(m—-1)-2 a 


For hypertori, HT, with N=m’, where the degree is /=2r, 
general formula for Moore reserve MR,(H7) is: 


d 
MR(HT)= rr nh 
= 


i=] 


Yn the figure 1 it gives normalized Moore reserves for 
five DIN with approximately the same number of nodes: 
one generalized hypercube, GHC1, three hypercubes, 
HC1+HC3 (HC1 is a binary hypercube), and two hypertori, 
HT1 and HT2 [9]. Let us prove the topological design/ 
evaluation principle given in previous section: for the DIN 
with same number of nodes, the normalized Moore reserve 
is bigger, the characteristics of structural locality are 


LOQE+1 - ees 
1.00EH10 
1.00E+09 


Figure 1. Nonalised IVbore reserve for generalized hypercube GHC1=20%S0, hyperoubes HC1=2!”, 
HO, HO3=10° and hypertori HTS, HI2=10° 


poorer. At the same number of nodes, the interconnection 
networks prove increasing structural locality proportional 
with the degree. So, the order in which the normalized 
Moore reserve diminishes and structural locality increases 
ISH 2s HCL(AT I) HE25 HG8ksand GHEIP The 
corresponding degrees are 6, 10, 15, 27 and 68. 


The degree is a syntetic measure while the Moore reserve 
is an analytical one. While the degree is a more abstract 
measure, the Moore reserve is measured in nodes. More 
intuitively than the degree, it help us to evaluate the 
structural locality, especially in the first part of curves, at 
the small 5 where does not matter the Moore's bound so 
much. For example, at the networks with the degrees 6 
(HT2) and 10 (HC1, HT1), we have the Moore reserve at 
the distance 6=0.2 124 nodes and, respectively, 45 nodes; at 
the distance 6=0.4, Moore reserve for HT2 is 23081 and for 
HEN(ADI); 78iaa(9): 


The degree is not sufficient to measure the structural 
locality. It must to take into consideration the diameter. In 
the tables 1+6 it gives an evaluation of hypercubes and 
hypertori m” with 2, 5 and 10 dimensions and 10, 20, 30, 40 
and 50 nodes per dimension [9]. In our studing case, the 
hypercubes m’ have the diameter r (proportional with the 
number of dimensions) and the hypertori m” have the degree 
2r (proportional with the number of dimensions, also). For 
evaluation of the structures, it used diametrical Moore 
reserve MRp, the degree / multiplied by the diameter D and 
structural average distance ds ([8, 11]. The structural 


average distances are given for measuring functional 
localities, also. 

From these tables, we draw some conclusions. If we 
selected the diameter for measuring structural locality, we 
should not to discern a HC with & dimensions and n, nodes 
per dimension from a HC with the same dimensions and m, 
nodes per dimension. If we selected the degree for 


Table 1. HC with 2 dimensions 

10n/d__|20n/d__—*[30 n/d 
225 [1045 12465 
18x2=36 


8.71x10"" 
245x5= 
=1225 


6.27x10"° 
145x5= 
=i29 


50nd _ 
7.84x10°° 


490x10= 
=4900 


6 18 
oe 
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Table 5. HT with 5 dimensions 
10n/d-[20n/d —*[30n/d ~—s*(40n/d_—*(S0 nd 
0.89x10"* |6.44x10" |4.62x107' |3.32x10” |2.38x10!”” 
10x25= 10x75= 10x100= [10x125= 
=1000 =1250 


en 


Mk» 


Table 6. HT with 10 dimensions 


10 n/d 20n/d_ [30 n/d 40 n/d 50 n/d 


MR, |9.62x10% [833x102 [7.22x10™ [6.25x10™ [5.42x10°° 
IxD |20x50= _[20x100= |20x150= [20x200= _|20x250= 

=1000 =2000 =3000 |=4000 =5000 
a (25 50 [75 100 125 


measuring structural locality, we should not to discern a HT 
with k dimensions and n; nodes per dimension from a HT 
with the dimensions and m, nodes per dimension. If we 
selected the degree divided by the diameter, we should not 
to discern a HC (or a HT) with & dimensions and n, nodes 
per dimension from a HC (or a HT) with s dimensions and 
same n, nodes per dimension. Finally, the degree multiplied 
by the diameter appears to evaluate the structural locality 
much better. Instead of all measurements of structural 
locality based on degree and/or diameter, we introduced 
Moore reserves, as stronger and primer measures. 

In the figure 2 is presented the evaluation of HC based 
on the tables 1+3, and inthe figure 3 is presented the 


OS ——————— ean 
0) 
5 ‘Se nn HC2dim 
2 oo pew | + HC5dim | 

+ 
w [ | —*— HC10dim | 
S 1E+15 _— Sal 
= 1E+12 Baty 
io [eS a ee 
8 1E+06 
2 1000 + ————— CSS 
4 : — 
10n/d 20n/d 30n/d 40n/d 50n/d 
Nodes/Dimension 
Figure 2. An evaluation of HC 
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Figure 3. An evaluation of HT 


evaluation of HT based on the tables 4+6. What we can see 
from these graphs? Firstly, we observe that the hypercubes 


have diametrical Moore reserves less than the hypertori. 
Secondly, we note the diametrical Moore reserve of the 
hypertori grows more rapidly with number of nodes per 
dimension, than those of the hypercubes.: Corroborated, 
both observations leads us to a conclusion: structural 
locality of the hypertori is poorer than that of the 
hypercubes, which was evident, taking into consideration 
the degree and/or diameter. 

In the figure 4 we give together both hypercubes and 
hypertori, based on the tables 1+6. What we observe on this 


1.00E+264 A 
1.00E+252 - 
VOCE DA0) a 
1.00E+228 
1.00E+216 
1.00E+204 
1.00E+192 - 
1.00E+180 - 
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1.00E+144 
1.00E+132 
1.00E+120 +-— 
1.00E+108 
1.00E+96 
1.00E+84 
1.00E+72 

1.00E+60 = igo 
1.00E+48 |— 
1.00E+36 ee 


1.00E+24 ~——- = | 
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Ve 
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#— HT2dim 
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*-- HC2dim 
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@— HC10dim 


Diametrical Moore Reserve 


Figure 4. An evaluation of HC and HT 


figure, from the point of view of the topological design, 
based on the principle enunciated in first part of the article? 
First, because the hypertori grow up fast than the 
hypercubes, the structures HTSdim and HT10dim lose 
properties of structural locality rapidly with nodes per 
dimension. The other structures are more constant with 
nodes per dimension. Second, it is interesting to compare 
the evolution of the structure HT with 2 dimensions, 
HT2dim, with evolution the HC structures. While the HC 
structures quickly saturate becoming near parallel, HT 
structure grows up intersecting the others. What does it 
mean? For 10 nodes/dimension the order of structural 
locality is HC2dim, H72dim, HCSdim and HC10dim and 
the order for 30 nodes/dimension is HC2dim, HCSdim, 
HT2dim and HCl0dim. The order is the order of 
characteristics of structural locality. That means that 
HT2dim has better characteristics of structural locality than 
HCSdim for 10 nodes/dimension comparatively with 30 
nodes/dimension. In spite of the difference of total number 
of nodes! Probably, above 70-80 nodes/dimension HT2dim 
ceased to manifest better characteristics of structural 
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locality than HC10dim. This second observation determines 
us to formulate a conjecture conceming the absolute 
measuring by the Moore reserves: 


Conjecture. The structural locality of a direct 
interconnection network measured by the Moore reserve 
is independent of total number of nodes of the network. 


In the figure 5 we give, comparatively, the evaluation of 
HC and HT networks with structural average distance dg - 


Structural Average Distance 


o : = = + = =-, 
10n/d 20n/d 30nd 40n/d 50néd 


Figure 5. Another evaluation of HC and HT 


Evaluation of these networks from the point of view of 
structural average distance (functional locality) is similar to 
evaluation by diametrical Moore reserve (structural 
locality). It is one difference: while HT2dim has structural 
locality better than HC10dim approximately under 70-80 
nodes per dimension, HT2dim has functional locality better 
than HC10dim only to approximately 20 nodes per 
dimension. HT2dim grows up more rapidly in functional 
locality measured by structural average distance than in 
structural locality measured by Moore reserve. What does it 
mean? Functionally it must be nearer structurally! 
Moreover, as a conclusion, in a good topological design, the 
functional locality it must nearer the structural locality! 


Ill. CONCLUSIONS 


Structural locality together with functional locality 


constitutes the important parameters for topological 
design/evaluation. We introduce Moore reserve as a 


possible absolute measure to interconnection structural 


locality of the direct interconnection networks. As an 
altemative to the degree, to the diameter and to them 
product or quotient, it measures siructural locality in a way 
stronger, absolute. Moore reserve is measured in nodes and, 
therefore, is more intuitively, more precisely and more 
primary than the degree and/or the diameter. 
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Intelligent Networks 


An algorithm for the efficient utilization of 
Service Control Capabilities in Intelligent 
Networks 


V. M. Stathopoulos, S. E. Polykalas, and I. S. Venieris* 


Abstract - The Service Control Point (SCP) sustains a key role 
in Intelligent Networks (IN) [1], [2] since it contains the 
software components for processing service requests. The 
increased demand of existing IN services by the users and 
the introduction of new more complex services in terms of 
processing requirements and messages exchanged among the 
IN nodes, overloads the IN and particularly the SCP. In this 
paper, an algorithm for the dynamic reservation of SCP 
resources is presented and evaluated. This algorithm uses a 
number of factors related with the profile requirements of 
the particular IN service; e.g. some services are very popular 
among users, others are categorized as expensive services due 
to their complexity, e.t.c.. The main scope of the proposed 
algorithm is to maximize the utilization of the SCP processor 
capacity, that is, to improve system scalability. The proposed 
algorithm allows the normal operation of the SCP under peak 
rates of requests for different kind of IN services. 
Furthermore it determines priorities on each IN service 
allowing the selective privileged handling of the requests. A 
study of the efficiency and adaptability of this algorithm over 
the SCP is performed by simulation. 


Index terms—Intelligent Network (IN), Service Control Point 
(SCP), Integrated Services Digita] Network (ISDN), Virtual 
Private Network (VPN) , Signaling System No. 7 (SS7). 


I. INTRODUCTION 


Congestion can occur in the Intelligent Network (IN) when 
the resource demands, to an IN node, exceed its capacity. 
The most affected node is the SCP (Service Control Point) 
because it contains the SCF (Service Control Function), 
namely, the service logic that controls the IN-based 
services. The result of congestion is that packets are lost 
due to extensive queuing in the SCP and this can be 
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expanded throughout the network. The IN network 
throughput degrades and may drop to zero. 


There is a large variety of services that nowadays users are 
willing to use. These services address different profile 
requirements concerning the resource demands that impose 
to the IN network. Hence, services can be categorized into 
different classes. Services that belong to the same class 
demand the same resources from the SCP. A service 
request is accepted only when available resources exist. 
Therefore one category may monopolize the IN resources, 
a situation that leads to undesirable disproportion among 
services. 

To overcome this obstacle, an algorithm is proposed, that 
is based on the dynamic reservation of SCP resources for 
every individual service. Its application in the SCP is 
investigated and the achieved improvements are addressed. 
This paper is organized as follows. In section II three 
methods for handling the resources of the SCP are 
introduced. The third method is based on the application of 
the algorithm in the SCP. In section III we give the details 
of the algorithm. Section IV contains a comparative 
performance evaluation study. Finally, in section V the 
conclusions are summarized. 


II. METHODS OF SCP RESOURCE ALLOCATION 


Services from one class are considered simple from the 
network view and also popular among users. The demand 
for these services is higher compared to services belonging 
to others classes. A second service class may be addressed 
to contain more complex services from the network view 
and so they are signed as more “expensive” from the 
service providers, than others. Obviously, each service 
Class impose its individual traffic load to the IN network 
and especially its individual resource demands to the SCP. 
A way to control these individualities is to apply a set of 
service priorities. SCP resources could be allocated to the 
call requests in a way that satisfies the call traffic loads of 
different classes in a fair base. In this paper, three methods 
will be analyzed. The first one, named as “unlimited”, can 
service every request, if and only if, the SCP has available 
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resources. The second method, named as “exclusive”, 
exclusively allocates SCP resources beforehand for every 
different service class. Hence, a threshold is defined that 
poses an upper limit for the maximum rate of requests that 
will be served by every service class. This method 
guarantees the offering of resources to every service class 
but does not solve the problem of handling fairly the 
asymmetric traffic. Calls asking for a specific service may 
be discarded if the dedicated, to this service class, 
resources has drawn, even if free resources exist in the 
system allocated in the buckets of other service classes. The 
third method, named as “algorithmic” considers a dynamic 
way of allocating the resources to the services. The 
advantages of the second method are basically supported 
and additionally the algorithm can authorize a service to 
“borrow” a percentage of resources that initially are 
allocated to another service class. 

Three different IN services named as service A, B and C 
are assumed that belong to individual service classes. 
Service A is the most popular one among users (e.g. 
televoting ), while service C is an expensive service (e.g. 
VPN ) among others. Service B is the most resource 
consuming one for the SCP because of its complexity. 
Every service exactly map with every service class. 


Ill. DESCRIPTION OF THE DYNAMIC APPROACH 


The operations of the algorithm applied within the SCP are 
described in the following. A mean percentage of the SCP 
resources is dedicated for every IN service class in 
advance. The allocation of SCP resources, to every service, 
is based on service traffic prediction. Thresholds tha, thb 
and thc, determine the upper limit for the dedicated 
percentages of the SCP resources for the processing of A, 
B, and C services respectively. According to the current 
traffic profile, the percentage of resources assigned to every 
individual service, can deviate from the mean rate. For 
example, service A initially allocated resources up to tha 
while service B uses less resources than the allocated 
determined up to thb. If the arrival rate of calls for service 
A increases further, then the remaining free percentage of 
resources until thb, is gradually assigned to service A (step 
by step). Therefore, the allocation of resources for service 
A and B is described by the equation: cur_tha = tha + step 
and cur _thb = thb - step (Figure 1) respectively. This 
process continues until an upper bound (/im_th), 
concerning the resources of service B, is reached, where 
beyond this, no extra “borrowing” can be negotiated. 
Parameter /im_th secures a minimum percentage of 
resources dedicated to every service from the total SCP 
resources. Finally, if the traffic rate of calls for service B 
increases further, it will gradually take back its percentage 
of resources. 


Pa Se Seu Fi Ot eens 
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Figure 1 Parameters of the algorithm 


The algorithm uses priorities for servicing IN requests. If 
service A needs to “borrow” a percentage of resources from 
the bucket dedicated to another service, the algorithm 
specifies the service that should offer it. A service with 
lower priority that currently demands less resources, than 
initially allocated to it, determined by the threshold 
(set_th), is the proper candidate. The value of this 
threshold is smaller than the initial value (th). 

The algorithm has the ability to gradually “lend” a 
percentage of resources from the bucket of a service with 
lower priority, to a service with higher priority, even if the 
lower priority service uses the total of its allocated 
resources. When, a minimum limit (u/im_th) is reached 
“lending” stops. This limit is higher that /im_th. 


The parameters used are defined below: 

th : the initial percentage of the assigned SCP resources to 
each service. This is determined by predicting the needs of 
the expected traffic load for every service. 

cur_th : the currently used resources from each service. 
set _th : the threshold of promptness which defines if a 
service sufficiently uses the dedicated to it percentage of 
resources. 

step : the percentage of resources that is newly assigned on 
demand to each service. 

lim_th : the possibly lower percentage of dedicated 
resources that has to be sustained by each service 

ulim_th: the minimum percentage of resources that are 
dedicated for every service when the offered traffic load 
asking for this service remains at the level of the initial 
prediction 

prior : the priority of every IN service 
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The SCP aigonten is stadied throazh sumaisnon ume 
PARSEC 3]. The simmaiation model comtems these coll 
generators. Every ome gomoraics trafic wath Gificront mem 
teafiic rates for cwery mndiwadinal IN servace. The mom rie 
of calls follows Poisson Gistribation [7]. [8]. The SCP 
pikysical eatity is modeled by a FIFO gucee [5]. The SCP 
factions of the SS7 protocol stack [5], J6]. 


BR Simadation Resuks - Comparison of Methods 


Tk ome scesamns, the SCP accopis requests for three 

call of am IN sernice aad the estimations of the expected 
traffic load fhe every serwace Miestraied at Table 1. The 
total expected traiiic lnad regeires 30% of the mene wnitel 
SCP resousces [5]. The resi 20% 3s dedicated for the 
here for the IN services are quite Close ip reality 5]. 


Tabie 1 Process times and traffic Inad disoibation 


The first scemamte assumes mesther the application of the 
algontie mor the pee-allocation of resoarces % any of the 
IN sermces (Method 1). The symbol 05 2 1 dGeckees that 
the offered traffic load for services A_B and C is cgmal 
30%, 200% aad 100% of the respectively expecied traffic 
leads for these services. Figeres 2. 3 and 4 Ghestreaic the 
Offered traffic load the accepted calls and the maximuem 
expected traffic Imad m callis/scc versus time. for the three 
services. The wahae of the munxizemm expected traffic load 
for cvery sommes, depends om the time the SCP consumes 
Process cvery Call. For semace A whore the poroomias: af 
the dedicated resommoss is af the level of 35%, it is assemed 
that the process time for every call of @ is Imsec. The 
mas expected traffic load eguals t 0.35*1000 = 350 
calis'sec. The Gismnibation of the offered traffic load versus 
time. is shown at Table 2. 


CS eS ee ae 
| 205 05 05 [11 05 | 2105] 211 


Tatie 2 Traffic load distribation 


Figure 2 Methed 1. Offered -Accogied-Mindamam 
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Figure 3_ Method 1 Offered -Acospied-Winmn 


 Galliseo 
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Figure +_ Method 1. Offered -Accepied-Mingeen 
expected Trafic versus simmaiztiom tise for sence C 


Resales show that calls are discarded for all sonaccs. Even 
thowg the offered traffic load of calls for senvaces B and C 
Temgaas low, 2 remarkable percomtage is rejected Ths is 
becuase calls asking for sence A ese most of the SCP 
resowEces. 

allocation of resources for every indiwadinall sorvice class 
(Method 2) Table 1 shows the allocated percomtazes of 
resoumces. Figures 5. 6 amd 7 tihestrate the offered traffic 
load the accepted calls and the marci expected traffic 
load wersus time. for the these services. 


simulation time moments 


Figure 5 Method 2. Offered -Accepted-Maximum 
expected Traffic versus simulation time for service A 


c—7 Offered 
== Accepted 
eos lax EXP CAP 


Exclusive 
Service B 


Figure 6: Method 2. Offered -Accepted-Maximum 
expected Traffic versus simulation time for service B 


Exclusive 
Service C 


Figure 7: Method 2. Offered -Accepted-Maximum 
expected Traffic versus simulation time for service C 


Results show that calls asking for a specific service are 
accepted if their traffic load does not exceed the initially 
maximum expected traffic load for this specific service. 
When this restriction is not satisfied by a specific service, 
calls are discarded even if the SCP has spare resources. 
Hence, even with this method we do not exploit the 
maximum offering of SCP resources. 

Finally, the algorithm of section III is applied at the SCP 
with the same traffic parameters. The algorithm 
parameters are shown at Table 3. Figures 8, 9 and 10 
illustrate the offered traffic load, the accepted calls and the 
maximum expected traffic load versus time, for the three 
IN services. 
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Algorithm 
Service A 


t= Accepted 
Max exp cap 


2 3 
simulation time moments 


Figure 8 Method 3. Offered -Accepted-Maximum 
expected Traffic versus simulation time for service A 


2 3 4 
simulation time moments 


Figure 9 Method 3. Offered -Accepted-Maximum 
expected Traffic versus simulation time for service B 
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Figure 10 Method 3. Offered -Accepted-Maximum 
expected Traffic versus simulation time for service C 


Until time t2 all calls are accepted because there are 
available SCP resources. After t2 the traffic load of service 
A exceeds the maximum expected traffic load and at t3 it 
becomes double of what was expected. Until t3 the offered 
traffic load for service C remains at 50% of the maximum 
expected traffic load. Hence, there are some spare 
resources that normally belong to service C bucket but 
these are not in use. The algorithm authorizes the SCP to 
offer these resources to service A. The traffic rate of calls 
that ask for service C increases for the time period t3 - t4. 
Hence, service C, gradually takes back the resources, 
offered to service A and all calls that belong to service C 
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are accepted In parallel. the calls for service A that exceed 
the maximum expected traffic rate are discarded. The 
algorithm authorize the SCP to accept some more calls for 
service A than those expected. Parameter “step” affects the 
speed of adaptation of the algonthm. Figure 11 shows 
results for three different values of “step”. The vertical 
axis illustrates the percentage of calls that were accepted. 
while in the horizontal axis the different values of “step™ 
are shown. The traffic load that is assumed is 
@1_1 4 The value of “step” determines the speed with 
which service C can “borrow” resources from the rest of 
the services. For example when “step” equals 1. the 
percentage of the discarded calls is 60%, because service C 
borrows resources with a low order (1% for every step). In 
contrast. when step equals 10_ the percentage of discarded 
calls reduced to 44%. The choice of value for the “step~ 
parameter. is determined by the service provider using as 2 
criterion “how important is to serve the high priority 


Step 
Scenano 1_1_4 


Sent Sens septo 


Figure 11 Speed of adaptation 


V_ CONCLUSIONS 


In the centralized architecture of the IN_ the SCP is 
responsible for serving all IN caiis. This may lead to an 
overloaded SCP. To overcome congestion situations and 
rejection of calls. in this paper. we have proposed an 
algorithm that guarantees fairness among services and 
high exploitation of SCP resources. The advantages of our 
approach have been highlighted through simulation and 
comparison with the “unlimited” and “exclusive” methods 
of allocating SCP resources. The “unlimited” method 
shown to present low performance under asymmetric loads 
of traffic. The “exclusive” method pre-allocates resources 
to every service class. and so protects them when a service 
tries t0 monopolize the available resources. However. even 
this method does not fully exploit the resources of the 
network. The proposed algorithm authorizes the SCP. to be 
dynamically adjusted to the offered traffic load needs. This 
Way, Services are protected from the peak loads of the other 
services. In addition. setting priorities to the service 
classes. gives the opportunity of privileged management of 
the SCP resources and quality of service assurance. 
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A Fault-Tolerant Multicast Routing Algorithm 
Based on Cube Algebra for Hypercube 
Multicomputers 


Salih GUNES, Student Member, IEEE', Nihat YILMAZ! and Ali OZTURK? 


Abstract—In this study a broadcast routing algorithm has been 
developed for faulty hypercube parallel processing system 
using cube algebra. Without any restriction to the number of 
the faulty nodes, the routing from the source node to the 
destination node is implemented minimally. The developed 
routing algorithm has been visually simulated via prepared 
data routing simulator program. It has been observed that this 
algorithm can be applied to various routing problems. 


Index Terms -- Fault tolerance, multicast routing, cube 
algebra, subcube allocation. 


I. INTRODUCTON 


A n-dimensional (n-cube) hypercube has 2” nodes. The 
nodes are numbered from 0 to 2”-/ and the neighbouring 
nodes varies from each other by only a single bit. A 
hypercube can be seperated into small subcubes [1]. If the 
hypercube has faulty nodes or links, then it is being called 
as faulty hypercube. Lots of studies in various subjects has 
been done in hypercube parallel processing systems. 


Subcube allocation, pattern enhancement and various types 
of routing algorithms are only a few of them. The routing 
algorithms can be investigated in two different parts 
according to the minimality of the data route. These can 
also be discussed in three sections namely node-to-node, 
node-to-multiple nodes (multicast) and node-to-all nodes 
(broadcast). These have great importance especially for 
minimal routing algorithms [2]. In this subject various 
routing algorithms are presented by lots of researchers [3, 4, 
5]. The aim in the multicast routing algorithm is to find the 
data route from the source node to destination nodes in 
minimal steps. 


We investigate fault-tolerant multicast routing based on 
cube algebra which aims at finding feasible minimum paths 
in a faulty hypercube.In this study an approach to determine 
the shortest path between the source and the destination 
nodes in faulty hypercube is proposed. In the second part of 
this paper, the cube algebra operations used in the 
developed routing algorithm and the general features of the 
coordinate multiplication operation are discussed. The 
method followed in the algorithm is explained in the third 

part and a sample simulator image which contains three 

destination nodes has been given. In the fourth section, the 
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developed routing algorithm has been introduced. The last 
part and a sample simulator image which contains three 
destination nodes has been given. In the fourth section, the 
developed routing algorithm has been introduced. The last 
section is dedicated for the discussions of comparisons 
between the broadcast routing algorithms developed 
beforehand and the implemented algorithm in this study. 


II. CUBE ALGEBRA 


The minimum terms in logic algebra constructs the 
foundation of cube algebra. However, there can be at least 
three variable in cube algebra. Three variables determine a 
cube. With cube algebra, a point, a line by two points, a 
plane by four lines and a cube by six planes can 
geometrically be defined. 


Cube algebra was initially developed and applied for 
determining the last situation of the switching functions. 
Afterwards it has been used to find the main components of 
the switching functions [6]. Moreover, to determine all the 
cubes in a faulty hypercube [7], to simplify the logical 
functions [8], in non-minimal routing algorithm [9] and in 
the unicast routing algorithm [10] the cube algebra is used. 
In the developed algorithm of this study the coordinate 
subtraction and multiplication operations of the cube 
algebra are used together. 


The coordinate subtraction and multiplication operations 
are applied between two cubes which have the same 
dimensions. But the subcubes to which the subtraction or 
multiplication operation will be applied can be in two 
different values if they have the same dimensions. The 
coordinate multiplication and subtraction operations are 
implemented in two steps. In the first step a multiplication 
or subtraction vector (VP and VS) is formed to determine a 
v; component. In the next step, the coordinate multiplication 
and subtraction operations are applied to the cubes A and B 
according to coordinate values of vectors VP and VS, 
which are determined in the previous step. 


In the following the procedure related to the coordinate 
multiplication operation is given. 


Let A and B be two cubes which have the same dimensions. 
A = 4], Q,..., Gj, ..., Gy and 
Be Dp DD. .2., Dj.es.Dn 
To apply the coordinate multiplication operation between 
these cubes, the multiplication vector VP is formed with the 
following operations to determine the v; component. 
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VP= v;vp..v;..v, and i €{1, 2, ..., m} in the vy 
component. 
If aj = bj then v; = aj = 5j 


If aj = * and bj # * then v; = 5j. 

If aj #* and 5; = * then v, = qj. 

If aj, bj €{0, 1} and aj = bj then v; =y. 
According to the coordinate values of the VP vector the 
coordinate multiplication results of the A and B cubes are as 
in the following; 

a)lf any v,; = y can’t be found, the result of the 
multiplication of A and B is the subeube VP which is the 
subcube of A and B (That is, A B = VP). 

b) If there can be found v, = y for only one i, then 
for the others if v; = a, = 5; then (here, j €//, 2, .... -1, i+], 
..., 2}). The result of the multiplication of the cubes A and B 
is the cube C which is found by replacing the symbol * for 
y, in the vector VP (That is, A *F B=C). 

c) If only a v, = y exists and for a, = *or 5; = *, 
v; # * then the result of the multiplication of the cubes A 
and B is the cube C which is found by replacing the symbol 
* for v, in the vector VP. Some part of the cube C is covered 
A and the other part of it is covered by B. This shows that 
cube C is related with A and B. 

d) if at least two v; and v; exist then v=v;=y. In this 
case, C which is the result of the coordinate multiplication 
of A and B is ©. This means that there is not a direct 
relation between the cubes A and B. 


Ill. THE METHOD FOLLOWED IN THE 
ALGORITHM 


In multicast routing method, the first step is to find all the 
subcubes remaining after extracting the failing nodes set 
from the full cube by means of coordinate subtraction 
operations. In the second step, the subcube which includes 
source node and the destination nodes is established from 
these subcubes. Then the subcube that includes the source 
node and the subcubes that include the other destination 
nodes are multiplied with the coordinate multiplication 
method. If a node or a link is found after this multiplication 
it is assumed that a message path exists between the source 
and the destination. The same method is applied recursively 
until the transition subcubes between the source and the 
destinations are found. 


After finding the subcubes between the source and the 
destinations, it is possible to reach the expected destinations 
in minimal routes by means of the developed destination 
routing algorithm. 


By this method it is implemented that the cube algebra 
method is applicable to the minimal multicast routing 
algorithm. Moreover, the superiority of this method on the 
previously developed algorithms is that it doesn’t put a 
restriction to the number of the failing nodes. In the 
following figure, the message route for the three destination 


node on the hypercube simulator which is developed to 
show : : 


F={10011,01000,01110,11000,10101,10000,00111,01010, 
11101,10000,10001} failing nodes set, A=00000 source 


node and B= {11111,11110,01001} destination nodes set 
can be seen. 


The simulator images related to this example is shows in 
Figure 1. 


ws 
‘A eeraa) 


@ Faulty Node 
Nonfaulty Node 
Figure 1. The Simulator for 5-dimensional Hypercube 


The data route found for the node 01001. 
00000-00001-01001 

The data route found for the node 11110. 
00000-00100 10100-11110 

The data route found for the node 11111 is 
00000000 10-00110-10110-1011111111 


To pay some attention, the data route from each of the three 
source nodes to the destination node is implemented in the 
existent minimal steps. 


IV. THE IMPLEMENTED ALGORITHM 
procedure TAnaProg.hyperClick(Sender: TObject); 


nbuyukhedefmes:=10000; 

neskiuzunluk:= 1000;nwaysay:=0;nekislemsay:=0; 

nhedefsay:=count of targets; 

for iz=1 to count of targets do begin 

hedefbulundu[iz]:=false; 

minmesvarmi[iz]:=false; 
hamdistance[iz]:=0; 
nhedefad[iz]:=memol .Lines.Strings[iz-1]; 
nhedefmesafe[iz]:=10000; 

end; 


GAS ee eee 
108 10” Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


for i:=1 to count of target do hamdistance[i]:= minimal 
distances between source and targets; 
cikart; 
ncikartsonuc:=sonuclar; 
ncikartsonucmax:=sonucmax; 
cikarmasonucunuboyutagoresirala; 
hatalinodlartayir; 
if nwaysay=1 then 
hedefvarmi(kaynaknodeno,nwayskup|1,1,1]); 
neskiluzunluk:=10000; 
net2: 
wayeklendimi:=false; 
for iz:=1 to nwaysay do begin 
‘if nwaysdurum|[iz|=false then continue; 
nekwaysay:=0; 
eklendimi:=false; 
for j:=1 to ncikartsonucmax do begin 
hata 1 :=false; 
for k:=1 to nwayskupsay[iz] do if 
ncikartsonuc[j]=nwayskup[iz,k, |] then hata1:=true; 
if hatal=true then continue; 
ncarpsonuc:=carpma(nwayskup[iz,nwayskupsay[iz], | },ncik 
artsonuc[j]); 
if ncarpsonuc='Bos kiime' then continue; 
if eklendimi=false then begin 
eklendimi:=true; 
wayeklendimi:=true; 
nwayskup[iz,nwayskupsay[iz]+1, 1 ]:=ncikartsonuc[j]; 
nwayskup[iz,nwayskupsay[iz]+1,2]:=ncarpsonuc; 
minmesvarmi| 1 ]:=false; 
for m:=1 to nnodesay do 
if ncarpsonuconnodead{m] and ncikartsonucD>nnodead[m] 
then begin 
for s:=1 to nnodesay do begin 
if (nnodedurum[s]=false) then continue; 
if nwayskup[iz,nwayskupsay[iz], | ]>nnodead[s] and 
nwayskup[iz,nwayskupsay|iz],2 }>nnodead[s] then begin 
nuzunluk:=arasindakiuzunluk(nnodead[m],nnodead[s]); 
if nuzuniuk<nnodemesafe[m] then 
begin 
nnodekayno[m}:=s; 
nnodekupad[m]:=nwayskup[iz,nwayskupsay[iz], |]; 
nnodemesafe[m]:=nuzunluk; 
nnodedurum[m ]:=true; 
if nnodemesafe[m ]>=nbuyukhedefmes then 
nnodedurum[m]:=false; 
end; 
hedefvarmi(m,ncikartsonuc[j]}); 
end://if 
end; // for s 
end; //for m 
if minmesvarmi[1]=false then nwaysdurum[iz]:=false; 
end else begin 
nekwaysay:=nekwaysay+ 1; 
nwayskupsay[nwaysay+nekwaysay ]:=nwayskupsay[iz]}+ 1; 
for s:=1 to nwayskupsay[iz] do begin 
nwayskup[nwaysay+nekwaysay,s, | ]:=nwayskup|iz,s, 1]; 
nwayskup[nwaysay+nekwaysay,s,2]:=nwayskup|iz,s,2]; 
end; //for s 
nwayskup[nwaysay+nekwaysay,nwayskupsay[iz]+1,1]:=ne 
ikartsonuc{[j]; 
nwayskup[nwaysay+nekwaysay,nwayskupsay[iz]+1,2]:=ne 
arpsonuc; 


nwaysdurum|[nwaysay+nekwaysay]:=true; 
minmesvarmi| 1 ]:=false; 
for m:=1 to nnodesay do begin 
f ncarpsonuconnodead[m] and ncikartsonuc>nnodead[m] 
then begin 
if hata3=true then continue; 
for s:=1 to nnodesay do begin 
if (nnodedurum{[s]=false) then continue; 
hata3:=false; 
naradeg5:=nwayskup[iz,nwayskupsay[iz], |]; 
naradeg3:=nwayskup[nwaysaytnekwaysay,nwayskupsay|i 
z},2] 
if nwayskup[iz,nwayskupsay|iz], 1 ]>nnodead[s] and 
wayskup[nwaysay+nekwaysay,nwayskupsay|iz],2 ]>nnode 
ad[s] then begin 
nuzunluk:=arasindakiuzunluk(nnodead[m ],nnodead[s}); 
if nuzunluk<nnodemesafe[m] then 
begin 
nnodekayno[m]:=s; 
nnodekupad[m]:=nwayskup[iz,nwayskupsay|iz], | ]; 
nnodemesafe[m ]:=nuzunluk; 
nnodedurum[m]:=true; 
if nnodemesafe[m ]>nbuyukhedefmes then 
nnodedurum[m]:=false; 
end; 
hedefvarmi(m,ncikartsonuc[}j]); 
end;//if 
end; // for s 
end; //for m 
if minmesvarmi| 1 ]=false then 
nwaysdurum|[nwaysay+nekwaysay]:=false; 
end; //if 
end; //for j 
if eklendimi=true then 
nwayskupsay[iz]:=nwayskupsay|[iz]+1; 
end; //for iz 
nwaysay:=nwaysay+nekwaysay; 
if wayeklendimi=true then goto net2; 
end; 
end; //procedure 


V. RESULTS AND DISCUSSIONS 


As a result, in the developed multicast routing algorithm a 
minimal route has been warranted without giving any 
restrictions to number of the faulty nodes. Moreover, it has 
been proven formally that the cube algebra which had been 
used in various applications formerly can be applied to the 
routing algorithms for a faulty hypercube. Based on the 
discussions above an algebraic method is proposed and an 
algorithm composed of its base for determination the paths 
from a source node to any destination node proposed. This 
algorithm allows to find all the paths between these nodes 
and is non dependent to number and configuration of faulty 
nodes and links. The work of the algorithm may be finished 
after determining the first path lenght of which is equal to 
Hamming distance. The algorithm is effective in a sense of 
fault tolerance for the hypercube system which has a great 
number of faulty nodes and/or links. This approach may be 
applied to the routing algorithm with local information 
about faulty nodes and links to broadcasting and 
multicasting algorithms and task allocation problems. 
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Figure 2. The variation of the number of the faulty nodes 
with respect to the number of operations. 


In figure 2, the variation of the number of the faulty nodes 
with respect to the total number of operations can be seen 
for 12 destination nodes in a 6-dimension hypercube. The 
variation of the total number of operations with respect to 
the destination node number can be seen in figure 3. This 
time, the number of the faulty node number remains the 
same . As can be seen from figure 2, the number of 
operations doesn’t increase after a specific number of fault 
nodes. Besides, it tends to decrease. This is very important 
for the efficiency of the algorithm. The total number of the 
operations increases according to the number of the 
destination nodes. This can be seen in figure 3. 


Number of Operation 


oOo 0 


Nuwmber of Faulty Nedes 


Figure 3. The variation of the number of the faulty nodes 
with respect to the number of the destination nodes. 
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Intelligent Tutoring Systems for Asynchronous 
Distance Education 


Marko Rosié¢, Slavomir Stankov, and Vlado Glavini¢, Member, IEEE 


Abstract — Asynchronous distance education delivery systems 
do not require real-time student-human teacher interaction 
thus enabling students to use tutoring resources anytime and 
anywhere. Among various possibilities for implementing 
asynchronous distance education delivery computer supported 
ones are nowadays the most popular. Categories thereof are 
intelligent tutoring systems that are used for supporting and 
improving the process of learning and teaching in arbitrary 
domain knowledge. In such education models the role of 
human teacher is replaced with a computer tutor basically 
consisting of a domain knowledge base as well as of both a 
student module with an appropriate student model and a 
teacher strategy guiding the learning and teaching process. 
The paper describes an authoring shell — Distributed Tutor- 
Expert System (DTEx-Sys) — developed for asynchronous 
distance education purposes, as a secondary knowledge source 
for teachers and their students in secondary and primary 
schools. 


Index Terms — asynchronous distance education, authoring 
shells, intelligent tutoring systems. 


I. INTRODUCTION 


The growing popularity and ease of access to the World 
Wide Web (WWW) stimulates present day interest in 
systems for self-learning in general and distance learning 
(DL) in particular. The Web provides itself as an 
outstanding delivery medium, thus acting as both a provider 
of content and of subject matter. On the other hand HTML 
technology allows and easy way to deliver all kinds of text, 
graphics, sound and video offer. 


It is claimed that the Web represents a well-chosen 
technology for implementing asynchronous distant 
education delivery systems. Such systems do not require 
real-time student/human teacher interaction hence enabling 
students to use tutoring resources anytime and anywhere. 
Moreover, students within asynchronous group learning 
environments do not need to manage their schedules. 
Learning networks based on asynchronous communication 
offer unique opportunities for active participation. Unlike 
the traditional classroom, students within on-line courses 
have access to the airtime they want or need, enabling every 
one of them to have a voice. Asynchronicity provides each 
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student the time to reflect, formulate ideas and compose 
responses thoughtfully, hence enabling the elevation of the 
quality of student interaction and participation. 


Among’ various’ possibilities for implementing 
asynchronous distance education delivery computer 
supported ones are nowadays the most popular. A category 
thereof are intelligent tutoring systems (ITS) that are used 
for supporting and improving the process of learning and 
teaching in arbitrary domain knowledge. In such education 
models the role of human teacher is replaced with a 
computer tutor, basically consisting of a domain knowledge 
base as well as of both a student module with an appropriate 
student model and a teacher strategy guiding the learning 
and teaching process [1]. An ITS takes into consideration 
the knowledge about what to teach (subject matter — 
domain knowledge), the way to teach (pedagogical strategy 
— teacher), as well as the relevant information about the 
student being taught (student). As shown in Figure 1, there 
are four main interconnected modules of an intelligent 
tutoring system: (i) domain knowledge module with the 
domain knowledge base, (ii) teacher module for guiding the 
teaching and learning process, (iii) student module with 
information that is specific to each individual student, and a 
(iv) suitable user interface module enabling interaction 
among student, teacher and domain knowledge [1,2]. 


User interface 


module 
Student 
module 
Domain knowledge 
module 


Figure 1. Typical ITS Architecture 


Teacher 
module 


Teaching and learning are expressed by a four-phase 
functional cycle: (1) didactics, (2) perception, (3) diagnostic 
and evaluation and (4) help and remediation [3]. This 
process is a self-adaptable one and is performed according 
to the assumed pre-knowledge, capabilities and students’ 
behavior. 
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The biggest problem in building an ITS is its high costs 
resulting from the inherently cyclic and time consuming 
design process. In order to overcome this problem another 
approach has been chosen, namely to create particular ITSs 
from flexible shells acting as program generators. Such 
authoring shells should show design usability and 
flexibility so as (i) to allow different representations of 
problem areas, and (ii) to enforce ease-of-use when 
developing an ITS for a particular problem area [4]. 


The asynchronous nature of ITSs makes them a good 
candidate for use throughout the Web, since they facilitate 
instruction offer anywhere and to anyone and for an 
arbitrary domain. 


In this paper we present a Web-oriented authoring shell — 
Distributed Tutor-Expert System (DTEx-Sys), which is 
developed for asynchronous distance education purposes. 
DTEx-Sys is designed as a 3-tier client-server architecture 
[5] where the intelligent tutoring functions are separated 
from the user interface and the knowledge base(s). The 
system functionality comprehends knowledge base access 
for arbitrary domain knowledge, along with testing, 
diagnosing and evaluation of students’ work. TEx-Sys 
features Web-based tutoring by means of standard browsers 
and an interactive hypermedia learning environment, and 
enables a particularly easy two-way communication 
between students and teachers, as well as students and 
students. 


II. WEB ORIENTED INTELLIGENT TUTORING SYSTEMS 


The Web has demonstrated itself as one of the most popular 
education supporting technologies on the Internet. 
Thousands of Web-based courses have been made available 
within the last five years already, unfortunately most of 
them using static hypertext pages [6]. In contrast, there are 
presently only few technoiogies for implementing ITSs on 
the Web. Frequently used technologies are those enabling 
dynamic generation of Web document contents, here 
including the following ones: Common Gateway Interface 
(CGI), Active Server Pages (ASP) and Java Server Pages 
(JSP). ASP and JSP are better solutions than CGI, because 
they are faster and support interoperability with components 
(Component Object Model or JavaBeans components). 
Another group of technologies for developing distributed 
ITSs are those based on distributed objects such as 
Common Object Request Broker Architecture (CORBA), 
Distributed Component Object Model (DCOM) and Java 
Remote Method Invocation (Java RMI). Additionally, for 
operating the clients’ side, Java applets and Active X 
components are also used. 


A. Web-Based Authoring Shells 


A number of Web-based authoring shells — six of them — 
are considered in the following, with the purpose of 


illustrating design decisions and implementation 
technologies. 


ELM-ART (ELM Adaptive Remote Tutor) is a Web based 
ITS to support the learning process of LISP programming. 
It is based on the on-site ELM-PE (ELM-Programming 
Environment) intelligent learning environment used to 
teach an introductory LISP course at the University of Trier 
[7]. The course materials in ELM-ART are presented as an 
on-line intelligent textbook with a hypermedia integrated 
problem solving environment. The system is implemented 
using Common Lisp Hypermedia Server functions. 


CALAT (Computer Aided Learning and Authoring 
environment for Tele-education) consists of an ITS shell on 
the Web server side and of a multimedia scene viewer on 
the client side. This system is built with the aim of 
achieving a high degree of adaptation to individual learners 
by dynamically selecting the subject matter. Besides, it 
includes a courseware authoring system that can be readily 
used by courseware specialists even without system or 
programming skills [8,9]. CALAT uses CGI in combination 
with appropriate plug-ins. 


Lee and Wang [10] proposed an intelligent hypermedia 
learning system that can perform intelligent as well as 
individual instruction based on the combination of CORBA 
and the Web. It provides instructional units for diagnosing 
students’ response and allows dynamic instruction 
strategies developed according to a student’s progress. 


Another system with a Web oriented intelligent learning 
environment is VALIENT (Virtual Adaptive Learning 
Integrated Environments using Net-based Technologies) 
[11]. Its authors aim to combine the strengths of the Web as 
a mechanism for delivering conventional instructional 
material with the learning opportunities available in 
intelligent learning environments. VALIENT uses JSP 
technologies and Java applets. 


The IDLE system (Intelligent Distance Learning 
Environment) supports the development of adaptive Web 
courseware in the areas of artificial intelligence (especially 
knowledge engineering) and cognitive sciences 
(psycholinguistics and neuro-psychology). The 
implemented prototype is a multi-agent portable system 
enforcing access restriction and using multimedia effects 
[12]. There are both a UNIX and a PC version, the UNIX 
one using CGI, while the PC one using JSP technology. 


The MANIC (Multimedia Asynchronous Networked 
Individualized Courseware) [13] project is targeted towards 
facilitating individualized asynchronous distance learning 
based on new technologies like high-speed computer 
networks and hypermedia instructional material (text, 
graphics, animations) presentation. Its authors concentrate 
the research especially on the effective use of Web 
hypermedia technology for delivering to students stored 
instructional courses, such as class notes/overheads and 
audio/video of classroom lectures. MANIC is implemented 
with CGI, using Real Audio and Real Video Server. 
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Ill. DISTRIBUTED TUTOR-EXPERT SYSTEM (DTEX-SYS) 


Our DTEx-Sys is built by keeping in mind universality and 
application quality thus resulting in generality of use. A 
brief consideration of DTEx-Sys universality features 
shows (i) the support for availability to a wide group of 
users and (ii) the possibility of its use for arbitrary domain 
knowledge. A coarse-grained decomposition of DTEx-Sys 
universality results in the following elements: 


° user time independence when accessing system 


services, 

user location independence when using system 

services, 

independence of client computer system platform, 

limited requirements upon client computer system, and 

° uniform way of storing knowledge regardless of the 
subject matter in question. 


Regarding application quality the following elements are 
considered: 


° 


the system is adaptable to individual users’ needs, 
learning is performed within a hypermedia 
environment, 

the system supports a teacher consultation option, 

it offers a user supervision option, 

it a incorporates a user test feature, 

it provides tested users with recommendations for 
future work, 

the system is compatible with resources pertaining to 
other distance learning systems, and 

it is suitable for future refinement. 


° 


° 


The system has been designed according to the general 
principles of student knowledge and skills acquisition, as 
well as of the methods and techniques of knowledge 
representation. Of course, DTEx-Sys builds upon the 
experiences of an earlier on-site version denoted TEx-Sys 
(Tutor-Expert System) [2], whose main objective has been 
the generation of hypermedia-rich learning environments 
for teaching and learning basic principles in control theory 
and their application in natural, technical and social 
systems. 


All of DTEx-Sys functions are accessible through standard 
Web browsers. Advantages of such an approach include the 
following ones: 


° 


the Web is a generally accepted service for envisaged 
users of DTEx-Sys, 

Web browsers are free of charge software products, 
Web browsers enable client platform independence, 
Web browsers allow for an interactive hypermedia 
learning environment, 

the two-way student-teacher as well as student-student 
communication can be made extremely easy using 
Internet resources, and 

there are many other educational resources located on 
the Internet which can additionally be used by DTEx- 
Sys. 


User ‘DTExSys > 
“interface 


functions 


Figure 2. DTEx-Sys 3-tier architecture 


DTEx-Sys is developed as 3-tier client-server architecture 
[5] where the intelligent tutoring functions are separated 
from the user interface and the knowledge base as shown in 
Figure 2. Semantic networks with frames are the formalism 
for knowledge representation in DTEx-Sys. Nodes 
represent domain knowledge objects, while links show 
relations between objects. Additionally nodes from the 
knowledge base can have any of the following structural 
attributes: textual descriptions with hypertext, pictures, 
(sequence of) slides, animations (moving pictures and 
sound) and Internet links. 


DTEx-Sys 


Knowledge 
bases 


Student 


Teacher 


Figure 3. Structure of DTEx-Sys shell 


DTEx-Sys users are supported in the process of learning 
and teaching with the following services: knowledge 
base(s) access, testing knowledge, reception of test results 
with recommendation for future work and consultation with 
instructors. These services are of course provided by their 
respective system modules, as shown in Figure 3: (i) 
knowledge access module, (ii) testing module and (iii) 
consultation module. All the modules exhibit appropriate 
user interfaces. E.g. the user interface for accessing the 
knowledge bases shows domain knowledge objects along 


3 http://localhost/test/Main. htm - Microsoft Internet Explorer ; 


Nodes: 77 main menu administrator 


e ODBC Data Progranis 
} Sources (32bit) > 
7 @ Passwords B 
@ Phone Dialer & 
q @ Power 
] Management & 
© Printers 
© Programs > 
@ Recycle Bin & 
j @ Regional Settinas 
>. } 


{7 @ Resource Meter 
@ Rune 
@ ScanDisk > 
@ Scheduled Task. 
> 


|. @ Settings Me. 


Figure 4. Access to knowledge base 
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with structure attributes assigned to the selected object, see 
Figure 4. Testing is implemented using Web quizzes: after 
testing the student receives as feedback both the rating and 
recommendations for future work, see Figure 5. All of these 
activities are under control of the DTEx-Sys shell. 


1. Is Passwords program from Control Penel? * 
- Yes 


2. What Is Drive Converter (FAT32) for? v 

3. What Is the name of the menu where you can find CD-Player? v 

4. What Is Disk Defragmenter for? v 

5. How to Set Windows Options? v 

6. What Is the name of menu where you can find Dial-Up Networking? wv 
7. What Is the name of Folder where you can set Keyboard Settings? v 


8. What Is the name of Folder where you can configure drivers for your hardware? * 
3 - Control Pane! Control Pane! 


- Direct Cable Connection 
- Dial-Up Networking Dial-Up Networking 


9. If you want to connect two or more computers you should use... x 
Direct Cable Connection 


10. What Is the name of Folder where you can restore previously deleted files? v 


Points 8/12 


Rating: 4 


Figure 5. Results of test and recommendations for 
future work (rating 4 roughly equivalent to B) 


DTEx-Sys is implemented with Microsoft technology, 
therefore including the Windows DNA (Distributed 
interNet Applications Architecture) model. The contents of 
Web documents that are distributed to clients is generated 
dynamically. 


TV. CONCLUSION 


In this paper we discuss authoring shells presently being a 
very promising CAI tool class for insuring Web-based 
asynchronous distant education. After shortly reviewing six 
such systems known from the literature, we describe the 
basic architecture and outline the implementation of the 
authoring shell Distributed Tutor-Expert System (DTEx- 
Sys). DTEx-Sys has been in use at the Faculty of Natural 
Sciences, Mathematics and Education, University of Split, 
for some time already. E.g. it has been used for supporting 
both the course "Computer Aided Instruction" during the 
past academic year for several study groups, and the 
realization of a number of diploma works for various 
domain knowledges. Beside that, in our teaching practice 
we use DTEx-Sys for improving the learning and teaching 
process in regular and further education, where it serves as 
a secondary knowledge source for teachers and their 
students in secondary and primary schools. 
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Abstract -- In this communication, we present the new 
concept for the Smart Sensors, their characteristics, the 
technical and economic advantages in the production 
field. We develop an application example, used the smart 
sensors in the control industry of a production site " 
assembly of metal tubes" , where we investigate the all 
system requirements of the monitoring in line control. 
This new generation of sensors is going to take a large 
and larger place in the sensor market in the future. They 
increase the products quality, ensure a geater safety 
margin and an availability of the production equipments. 
Keywords : Design; Smart Sensors; Production 
Automation System; Process monitoring. 


I. INTRODUCTION 


Several sectors of industrial applications (domotics, 
automotive, Production automation process, robotics, 
aeronautics...) need sensors with a great quantity 
produced at the lowest cost as possible, better 
adapted to the market constraints (security, safety, 
reliability, economy) [1]. In addition the 
dependability becomes a major factor to ensure an 
optimal competivity of the production equipment.In 
this way, many manufacturers encouraged researchs 
in the design of smart sensors provided with local 
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computation capabilities ( microprocessor, microcontroller, 
DSP), which can be interconnected on a bus system (field 
bus) and the all electonics necessary for the signal 
conditioning are included. 


II. GENERIC ARCHITECTURE [2] 


The smart sensor incorporates an internal arithmetic unit 
(microprocessor, microcontroller), a memory support, a 
conditioning signal system with one or more transducers 
and supporting electronics on it, and communication 
interface. The latter constitues a fundamental element for 
the smart sensor design. 


A. Principal components 


- Conditioning stage : standardize the transducers signals 

- Digitalization stage : multiplexing, sampling, numerical 
conversion 

- Processing stage : computation, storage, display of the 
results, activation of alarm... 

- Communication inteface : bidirectional information 
exchange. (see figure 1.) 

For this smart sensor, an architecture for the central 

processing was proposed by Beaudouin [5] accross the 
figure 2. Mesurande ( primary measure) 


Transducer + conditioner i 


Power supply 
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‘ Communication 


Communication network 


Fig 2. Central processing architecture 


The breakdown detection, failure diagnosis and 
emergency reaction can be exploited with the help of 
small basic algorithms stored in memory. 

The designed device has the necessary means to 
indicate the defective links and informed the 
supervisor station [6], this decrease times of defect 
localization and diagnosis, an early detection of the 
drifts or incidents is planned "preventive 
maintenance" [7 ]. 


B. Advantages of the Local Intelligence 


1. Acquired immunity and robustness 
The signal digitized or processed in a proximity of 
the sensor : elimination of parasites or degradation 
during the processing. The performances benefited 
are : better resolution, higher accuracy and good 
stability, especially under severe environmental 
conditions. 
2. Computing power 
Specialized processing tooi (ex D.S.P) is associated 
with each site of measurement or control: 
- Processing decentralization 
- Release the acquisition station for other nobler 
tasks. 
- Measurement Credibility (reliability, accuracy). 
- Real time transfer. 
3. Lowest cost of electronics conditioning induced by 
the micrelectronic technology. 
4. Weak and decreasing cost of digital processing 
signal. 
5. Easy wiring " topology Bus": 
- significant flow 
- sufficient reliability 
- implementation simplicity. 
6. Reduce maintenance cost. 
7. Identification 
The sensor is identified by a code or an address in 
order to dialogue with the control station: 
- Sending of information (measurement, state, 
internal control...). 


- Reception of command or data in numerical form. 
III. CONSTRAINTS 


A. Environmental constraints 


Electronics associated must be able to support the 
environment parameters (temperature, pressure, 
moisture...), electromagnetic noise, explosive atmosphere... 


B. Power Supply 

Convey the Power Supply ( solar cell, battery,...), when a 
great number of sensor must be to install, in order to 
preserve the cabling reduction. 


C. Implementation difficulties 

Develop An acquisition chain with intelligent sensors, is 
a long operation, requires a heavy and _ specific 
infrastructure , also the sensors site must be optimized. 


D. Lack of communication standard 

It doesn't really exist a standartd of communication, the 
protocols differ from one manufacturer to another, each 
try to impose its own standard, 
this limits a wide penetration of this sensors category in 
industrial fabric. 


IV . NEWS SERVICES OFFRED 

The smart sensor is a tool wich must be configurable by 
numerical communication in order to be adapted for the 
application used. The mains services looked for: 

- Measurement and productivity improvement 
- Reduce production costs 
- Reduce maintenance costs. 

Maintenance consists in repairing the weakening element 
in order to ensure the equipment aviability. The history 
storage schows the true policy of preventive maintenance 
[8]. 

V. APPLICATION 
A. Production site monitoring 
" Assemby of metal tubes" 
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The currently industry is faced with the need to 
boost reliability and reduce maintenance costs, it 
possible to perform early inspection and repair in 
complicated systems, to minimize the extent of 
damage or destruction of the total system, this 
detected by quick surveillance with smart sensors. 

In addition security sensor ( fire detection, smoke, 
pollutant gas,...), a visual inspection using camera 
it's necessary in order to detect any imperfection of 
process. The use a telemetric sensor with ultrason 
determine the object proximity or its position space. 
An image processing and acoustic signals are 
necessary to estimate the environment of the mobile 
object or to detect the anomalies. 

For the ultrason sensor [9], the measurement 
distance is corrected ambient temperature, the sound 
velocity is related to temperature T by the following 
relation : 

Clas). = Cy( 1+ 1/273)" «Co 1 =0 ) = 331.4 
m/s. 

When the frequency increases, the attenuation is 
strong and the distance reached is weak. 

The metal, wood, glass, rubber reflect perfectly the 
Ultrasounds, but the clothing, cotton, wool absorb it. 


Carry - camera 


Ultrasonic boxes 


Network communication 


nr 


Velocity m/s 


16 20 AL 0 20 4o 60 T(°C) 
Fig 3. Ultrasounds velocity as a function of temperature 


¢ Ultrasound in nondestructive control 


The Ultra-sonic inspection of the manufacturing defects 
based on the echos analysis observed after an ultrasonore 
wave emission by a piezoelectric transducer. 

These wave is propagated in controlled material and 
undergoes reflexion and diffraction phenomena. In 
presence of a surface defect ( crack, internal defect,...), 
one part of incidental signal is reflected and detected by 
the sensor: 

- The delay analysis between the emitted signal and the 
echo, determines a defect localization 

(t,=2n0/f ; f=40khz). 

- The form analysis of the echo allows to dtermine of the 
defect geometry. 
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Reflect - screem 


The development of an autonomous software m onder 
t ensure the manasement compurer-assisted of this 
park with smart semsozs, appears very miteresime. 
where each semsor sovem system part under 
surveillance, this same sensor can be used for other 
tasks by a specific reprogrammmine. 
This system contamme different nafure of sensozs, 
mast present an casy 


~ 


. Netmork Bitbus topology 


B. Conclusion 

We evoked im this article, the smart sensor concept 
benefitting from the new techniques of integration. 
This device have the potential and high performance 
for use m 2 variety of industry fields specilly in 
spplication for control or metrology. The architecture 
presented is well adapted for production site, an 
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The Role of User Models in Adaptive 
Hypermedia Systems 
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Abstract -— Computers have been used in education already 
from the beginning. Educational systems have developed from 
first computer assisted learning systems to more complex 
intelligent tutoring systems and hypermedia systems. 
Intelligent tutoring systems are knowledge centred and are 
able to adapt the instructional sequence to the particular user. 
On the other hand, hypermedia systems are based on 
hypermedia and the user can freely browse the learning 
material. 

Lately, the last two technologies have merged into so called 
adaptive hypermedia systems. They are actually hypermedia 
systems, which incorporate some _ intelligent tutoring 
techniques. Therefore, they offer the freedom of exploratory 
learning, yet dynamically adapt to the individual user’s 
knowledge level and learning goals, provide intelligent 
guidance, and support the user in acquiring knowledge. 

In intelligent tutoring systems, the adaptation is based on user 
modelling. Since adaptive hypermedia systems take the 
adaptation principles from intelligent tutoring systems, they 
also use user models for adapting the system to a particular 
user. Hence, the user model is one of the most important 
components for adaptation. Although the adaptation itself can 
be realised through different techniques, it is always based on 
the user model. 

In the paper, adaptive hypermedia systems and their general 
structure are described first. Then the role of a user model is 
presented together with the various approaches to user 
modelling in adaptive hypermedia systems. In the end, we 
investigate user modelling in different educational adaptive 
hypermedia systems. 


Index terms — hypermedia systems, user model, adaptation. 


I. INTRODUCTION 


Within years, two main approaches to computer assisted 
learning have developed: educational hypermedia and 
intelligent tutoring. 


Educational hypermedia systems are computer based 
learning systems, where the teaching material is presented in 
hypermedia form. Therefore, they support free exploration 
of the material, where the user has full control over the 
learning process (user—driven learning process) [5]. Pure 
hypermedia systems lack good navigational tools (getting 
lost problem) and users often encounter problems in finding 
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relevant information and getting an overview of the 
material. 


Intelligent tutoring systems are tutoring systems that support 
the user in acquiring knowledge by dynamic adaptation of 
the system to the individual user [18]. Thus, the user has 
only little (or no at all) control over the learning process 
(system—driven learning process), but the system directly 
guides the user and adapts to their knowledge level. The 
biggest drawback of intelligent tutoring systems is the 
inflexibility of direct guidance, where the user is unable to 
directly influent the learning process. 


In recent years, a new approach, adaptive hypermedia 
[5,12], is becoming quite popular and widely used. It is 
based on traditional hypermedia presentation, which is 
upgraded with intelligent tutoring components. Such 
integrated system inherits the freedom of hypermedia and 
adaptiveness and guidance of intelligent tutoring, thus 
advantages from both. 


The focus of this paper is on different approaches to user 
modelling in adaptive hypermedia systems. The next section 
presents adaptive hypermedia systems together with their 
general architecture. Section three is focused on the user 
model, where different approaches to user modelling are 
investigated. Techniques used in some concrete educational 
adaptive systems follow in the last section. 


II. ADAPTIVE HYPERMEDIA SYSTEMS 


Typically, an adaptive hypermedia system is a system, 
which [8]: 


e is based on hypermedia, 

e includes a domain model composed of a set of 
elementary pieces of knowledge and their relationships 
in information space, 

e maintains an explicit user model that records individual 
user properties, and 

e is able to adapt some visual or functional parts of the 
system according to the user model. 


An adaptive hypermedia system resembles the structure of 
an intelligent tutoring system although it also introduces 
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some new hypermedia related components. It consists of 
three main components (see Figure 1): 


1. hypermedia component (covers domain hyperspace 
organisation, representation and navigation). 

2. intelligent tutoring component (covers all the intelligent 
actions of the system), and 

3. supervisory component (adjusts the operation of both 
components). 


ee ee ee ee 


Figure | 
Structure of an adaptive hypermedia system 


To be able to adapt itself to an individual user, the system 
has to be aware of the teaching domai., the individual users 
and their knowledge and has to monitor their learning 
progress. Hence, the domain knowledge and the user. the 
most essential parts of any adaptive system, have to be 
represented in the system. 


According to the definition of adaptive hypermedia systems, 
intelligent tutoring component includes three modules, 
inherited from intelligent tutoring systems [18]: 


e domain model (represents knowledge about the 
teaching domain and explains what is being taught), 

e user model (describes user's knowledge, saves 
information specific to each individual user and 
therefore explains who is being taught). and 

e pedagogical module (takes care of all didactical 
decisions of the system and supervises the learning. 
thus explains how to teach). 


If we look at the traditional teaching process as a 
communication of knowledge, it also involves three 
different types of knowledge: of the teaching domain, of the 
teaching strategies and of the learner [9,18]. Latter is crucial 
for the individualisation of the teaching process. 
Experienced teachers usually have well developed models 
of individual students, as well as of the whole class. These 
mental models, records of achievements or student profiles 
are constantly updated as the teacher interacts with the 
students. The teacher uses them to select appropriate 
domain knowledge and teaching strategies. Similarly, an 


adaptive system uses user model to select suitable 
presentation of information and constantly updates the 
model during the learning process. 


Til. USER MODEL 


A perfect user model would include all features of the user's 
behaviour and knowledge that effect their leammg and 
performance [18]. The information. contained im the user 
model, determime the adaptability of the system 
Constructing such a model is quite complicated task, even 
for people not only for machines, thus simplified, partial 
models are used in practice. 


Regarding the user model, three aspects have to be 
considered: what information about the user is included m 
the model and how it is obtained, representation of this 
information in the system, and the process of forming and 
updating the model. 


A. Information 


Information about the user, which is saved m the user 
model, can be regarded as domain-independent and 
domain—dependent; or divided ito knowledge component 
(user's knowledge) and imterfaces component (users 
preferences); or as static (constant through the leammg 
process) and dynamic (changes during the leaming process). 


Static Properties — Static part of the user mode] [11.12] 
covers user's personal characteristics (age, gender. grade_.). 
user's capabilities (background knowledge. proficiencies, 
cognitive and non-cognitive abilities..}). users 
preferences etc. 

These properties are usually collected at the begining of 
the learning process, using questionnaires (users supply data 
on their selves) and different tests. 


Dynamic Properties — Information about the user 
interaction with the system makes up the dynamic part of 
the user model [11,12]. This covers user's knowledge. 
concepts and skills (mcluding misconceptions), learnmg 
Style, motivation, viewpoints, current goals, plans and 
believes, learning activities that have been camied out. 
objectives that have been achieved... 

Inputs for this part of the user model are gathered directly 
from user (specification of current goal), through tests and 
practice (test results, user history of responses and problem 
solving behaviour), or user's actions (browsing behaviour, 
number of nodes visited, visited concepts, time spent on 
page, total session time, selection of links, searching for 
further info, queries to the help system). This information is 
constantly being collected during the learning process and is 
also used for updating the user model. 
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B. Representation 


Different methods can be used for constructing the user 
model. They include [12]: 


e Bayesian methods, 

e Machine learning methods (rule learning, learning of 

probabilities, instance—based learning), 

Logic—based methods (first order predicate calculus), 

Overlay methods, 

Stereotype methods, 

Other general methods (plan recognition), 

Specifically developed computational procedures 

(user’s expertise is calculated from their navigational 

actions or time spent on documents), and 

e Specifically developed qualitative rules and procedures 
(special rules regarding user’s properties or behaviour). 


In practice, a combination of two or more methods is 
frequently applied, especially when different methods are 
used for initialising and maintaining the user model. This 
assures more accurate modelling and allows better 
exploitation of gathered information. 


C. Forming and Updating 


The methods for exploiting the user model are similar to the 
methods for constructing it. Here also, well-known methods 
from artificial intelligence and machine learning can be 
used (Bayesian networks, rule learning, instance-based 
learning, learning of probabilities, logic-based, decision— 
theoretic, heuristic...), as well as other general techniques 
and principles (plan recognition), specifically developed 
computational procedures (procedure for computing the 
strength of a link between two documents regarding user’s 
current goals and interests), or specifically developed 
qualitative rules and procedures (rules for selecting and 
evaluating examples, rules for choosing adaptation type, 
rules for choosing questions) [11,12]. 


Techniques for user model representation and inference 
used in some educational adaptive hypermedia systems are 
presented in the next section. 


IV. USER MODELLING IN PRACTICE 


Approaches to user modelling in eleven different 
educational systems are briefly presented together with their 
approaches to user modelling [1]. First, some system that 
use an overlay method are described, because this is quite 
simple and widely used technique. Next, stereotype method 
is mentioned, a technique that can be found in many 
systems, especially in combination with other more 
sophisticated ones. In the end, we take a look at the systems 
that use a combination of different techniques. 


Overlay models — User’s knowledge is regarded as a subset 
of expert’s knowledge and therefore user’s knowledge or 
their approach to problem solving is described through the 
knowledge or approach of the expert. Because the 
knowledge of the user is a subset of domain knowledge (an 
overlay over the domain knowledge), the same 
representation can be used for domain model and user 
model. If the user model is a real subset of the domain 
model and the incorrect knowledge is not included in the 
model, we talk about strict overlay model. In extended 
overlay model, incorrect or faulty knowledge and 
misconceptions are also part of the model. 


Among the systems that use overlay model for user 
modelling are the following. 


AHA (Adaptive Hypermedia Architecture) system (De Bra 
[6]) is a general adaptive hypermedia system for 
hypermedia courses delivery over the Web. The user model 
is based on the knowledge of concepts that is acquired by 
reading hypermedia pages and by solving tests. 


In AHM (Pilar da Silva [15]), a prototype system for 
developing adaptive hypermedia courseware, adaptation 
depends on user’s level of expertise about known concepts, 
which is a subset of all domain concepts. 


Anes (Kavcic [13]) is a shell of adaptive hypermedia system 
on the Web. The user is modelled using strict overlay model 
over the network of domain knowledge concepts. 


ISTS—Tutor (Brusilovsky [3]) is intended for learning the 
print formatting language of an information retrieval system 
CDS/ISIS. It uses the overlay user model with a set of 
integer counters. 


RATH (Relational Adaptive Tutoring Hypertext), an 
adaptive tutoring WWW system prototype (Hockemeyer 
and Albert [10]), is based on knowledge space theory, 
where the user’s knowledge is described by their knowledge 
state, which is a set of items (or test problems) that the user 
is able to solve. Each knowledge state is an overlay over the 
domain knowledge items. 


Stereotype models — Individual user is assigned to one or 
more stereotypes (such as novice, intermediate, advanced, 
or expert). Each stereotype has its predefined properties and 
the user that belongs to that stereotype also inherits its 
properties. 


Many systems use stereotypes for describing the user, but 
they usually combine them with other methods. Here, only 
one pure stereotype system is described. 


HyperTutor (Pérez, Gutiérrez and Lopistéguy [14]) is an 
adaptive hypermedia system, which employs exercises for 
obtaining information about the user. It uses stereotypes for 
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user modelling and the user can belong to one of three 
groups: novice, medium, or expert. 


Combination of various methods — Sometimes only one 
method does not satisfy all modelling needs and a 
combination of two or more methods has to be chosen. 


Anatom-Tutor (Beaumont [2]) is an intelligent tutoring 
system for teaching anatomy. It contains a rule-based user 
modelling component, which operates with weighted rules 
and stereotypes. 


DCG (Dynamic Courseware Generator) system (Vassileva 
[17]) is a tool for authoring adaptive computer assisted 
learning courses. For each individual user, the system 
creates a Bayesian user model, which is an overlay over the 
concept structure. Bayesian networks employ probabilistic 
reasoning about user’s knowledge state based on user’s 
interaction with the system. 


ELM-ART (Episodic Learner Model — Adaptive Remote 
Tutor) system (Brusilovsky, Weber and Schwarz [4]) is an 
intelligent tutoring system based on World Wide Web for 
supporting learning of programming in Lisp. It uses two 
user modelling techniques: simple overlay models and more 
complex episodic learner models. In episodic learner 
models, the user’s knowledge is represented by a series of 
episodes that denote user’s individual learning history, their 
behaviour and former problem solving situations. 


InterBook (Brusilovsky, Schwarz and Weber [7]), an 
authoring system for designing Web-based adaptive 
hypermedia systems, uses a concept based overlay model, 
although the user model is initialised using a stereotype 
model. 


PUSH (Plan and User Sensitive Help) Project (H66k et al. 
[16]) is a prototype of an adaptive help assistant for a 
specific domain and application, and investigates ways of 
utilising adaptive hypermedia to aid users of an on-line 
manual. For user modelling it uses both plan inference 
(deducing user’s goals from their queries) and stereotypical 
user modelling. 


V. CONCLUSION 


Adaptive hypermedia helps user finding or accessing 
relevant information, tailors information presentation to the 
user, or adapts the interface to the user, and this way helps 


the user to use the system and supports them in learning 
process. 


Adaptive hypermedia combines traditional hypermedia with 
intelligent tutoring and therefore benefits from free 
exploration as well as from its adaptation abilities and direct 
guidance. All adaptation is done via the user model that 
saves the data about each individual user of the system. 


a 


~ 


Thus, the user model has to reflect as many important 
features of the user as possible. Unfortunately, it is very 
difficult to obtain decision—relevant information about the 
user. Hence, user models are simplified in practice and 
consider only some user’s features. 
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Functionality Specification for Adaptive User 
Interfaces 


Andrina Grani¢ and Vlado Glavini¢, Member, IEEE 


Abstract — The goals of human-computer interaction are to 
provide functional, usable, safe and well designed computer 
system interfaces, thus achieving their almost transparency 
and enabling end users to fully concentrate on the work. 
Particularly in the last years it has been recognized the need 
for the development of adaptive user interfaces in several 
application areas. Our research is concentrated on adaptive 
user interfaces for intelligent tutoring systems, which allow 
human teacher emulation in the process of learning and 
teaching. In the paper we consider the functionality 
specification of an intelligent tutoring system user interface 
expressed by means of User Action Notation, and discuss the 
way it has to change the provided functionality and its 
presentation on the basis of users’ individual characteristics, 
in order to accommodate differing user’s requirements. 


Index Terms — adaptability, intelligent tutoring systems, User 
Action Notation, user interfaces 


I. INTRODUCTION 


The goals of human-computer interaction are to provide 
functional, usable, safe and eventually well designed 
computer system interfaces, thus achieving their almost 
transparency and enabling end users to fully concentrate on 
the work. This is the main reason why in designing human- 
computer communication it becomes increasingly important 
to specify the "look and feel" of a proposed interface, i.e. its 
design and essential behaviour. This in turn supports a 
system’s functionality specification within the respective 
user conceptual model. 


Adaptive interfaces extend the current state of the art in 
interface design of computer systems and are a promising 
attempt to overcome problems due to both the increasing 
complexity and sophistication of human-computer 
interaction as well as individual end user needs. Our 
research is concentrated on usable, human-centered, 
adaptive user interfaces for intelligent tutoring systems. 
Intelligent tutoring systems (ITS) [1] are a class of 
computer systems allowing human teacher emulation in the 
process of learning and teaching. Understanding diverse 
kinds of end users, their individual needs and tasks that 
must be accomplished is essential, because from the user's 
standpoint most of an intelligent tutoring system can be 
considered as a user interface. 


Andrina Grani¢ is with the Faculty of Natural Sciences, Mathematics and 
Education, University of Split, HR-21000 Split, Croatia (e-mail: 
granic@pmfst.hr). 

Vlado Glavinié is with the Faculty of Electrical Engineering and 
Computing, University of Zagreb, HR-10000 Zagreb, Croatia (e-mail: 
vlado.glavinic@fer.hr). 


The paper discusses the role of user interfaces for ITSs and 
the possibilities of determining the respective functionality 
at the user-machine interaction level along with aspects of 
its adaptability. The User Action Notation [2] is advocated 
as a suitable candidate for ITSs user interface specification. 


II. USER INTERFACES 


As computer systems have developed over the last years, 
user interface complexity, sophistication and use of the 
visual expressions have also evolved, thus influencing the 
development of more appropriate interaction between a 
human user and the computer system. With the increasing 
desire for high-quality user-centered computer systems, 
more care is given to visual forms of human-computer 
interaction. 


The principal function of the user interface is to facilitate 
human-computer communication by enveloping in a 
dialogue both the hardware and software of a computer 
system, and particularly the semantic of applications. This 
results in hiding the structure of input/output devices, 
operating systems, networks and applications, and lets the 
user switch applications rapidly [3]. 


The design and implementation of human-computer 
interfaces is very hard and time consuming, because of its 
complexity and difficulty for implementation, debugging, as 
well as modification [4, 5]; studies show that the user 
interface portion of an application comprises about 50% of 
the code and development time [ibid]. Additional 
difficulties arise in the case of user interfaces that are to 
tailor a system's interactive behaviour according to 
requirements of an interaction, thus creating the need for 
adaptive human-computer interaction. 


A. Adaptive User Interfaces 


The goal of adaptation is to present the human user with an 
interface that is easy, efficient and effective to use. To reach 
these goals it is necessary to provide a user with an 
interface suitable for changing either (i) the way it 
communicates with her/him or (ii) the system functionality 
provided on the basis of the individual characteristics of the 
user herself/himself [6, 7]. 


It has been argued that adaptive user interfaces provide the 
only way to achieve certain usability goals, such as meeting 
the changing user requirements. Within this context a user 
interface is called intelligent in the degree it adapts itself to 
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user needs. Intelligent user interfaces facilitate a more 
“natural” user-computer interaction, attempting to imitate 
human-human communication [8] and constitute a major 
direction in current human-computer interaction research. 
Several efforts towards the development of adaptive user 
interfaces have been reported in the literature since the early 
eighties when they appeared, resulting in a number of 
prototyping systems in several application domains like 
intelligent help, intelligent tutoring or information filtering 
[9, 10]. 


Ill. USER INTERFACE SPECIFICATION 


No matter how usable an interface design is, without a 
complete, thorough and understandable specification, 
misunderstandings can arise and unusable or incomplete 
user interfaces will result. Also, an appropriately specified 
functionality of a system can help to discover possible 
inadequacies. Since we consider eminently interactive 
programs as ITSs, whose functionality deeply influences the 
user conceptual model and in turn the system’s look and 
feel, in the following the distinction between user interface 
and system functionality will be somewhat blurred. 


Considering existing user interface specification methods it 
should be noted that at the present moment there is a 
number of them already developed which are in a greater or 
lesser degree suitable for the purposes of adaptive user 
interface definition. 


Grammar or diagram approaches are suited for some of the 
interaction styles like menus, commands or form fill-ins; 
however, they are clumsy for direct manipulation interface 
definition, because they cannot conveniently cope with the 
variety of permissible actions and visual feedback the 
system provides [11]. In addition, direct manipulation 
interfaces depend heavily on the context to determine the 
meaning of the input. Thus alternative methods are needed 
which could handle the rich world of direct manipulation 
interfaces (pointing, dragging, clicking). The User Action 
Notation (UAN) is a representative of such methods being a 
high-level notation to facilitate capturing the interaction 
design [12, 2, 5]. According to one of its authors — Deborah 
Hix — UAN “‘is a notation to write down the actions that the 
user will perform with the system and system responses, 
from the user’s point of view” [13]. 


IV. INTELLIGENT TUTORING SYSTEMS 


Intelligent tutoring systems (ITS) are a new generation of 
computer assisted instruction systems (CAI) intended to 
support the process of learning and teaching in an arbitrary 
domain knowledge by taking into consideration both 
teachers’ and students’ individuality [1, 14]. Depending on 
students’ learning capabilities, ITSs can adjust the contents 
and the way of domain knowledge perception. ITSs are 
built on a fairly well established architecture, which relies 
on four interconnected modules, see Figure 1: (i) expert 
module acting as the domain knowledge unit, (ii) student 
module comprehending the generated student model based 


Figure 1. General ITS Architecture 


on the learning and teaching process in the domain 
knowledge, (iii) teacher module guiding the learning and 
teaching process and (iv) communication module realizing 
the interaction among student, teacher and knowledge. 


As the need to cover a variety of different domains have 
arisen since, instead of having a number of specialized ITSs 
for the domains of interest, “ITS generators” were 
developed, which are usually denoted as authoring shells 
[15]. These shells can be “programmed” for a particular 
domain by modifying the domain knowledge thus resulting 
in a specific ITS. 


In order to study the possibilities such a tool could provide 
in instruction/education, an intelligent authoring shell 
named TEx-Sys (Tutor-Expert System) has been developed 
[16]. TEx-Sys is intended to adjust to teachers as well as to 
students within a hypermedia interactive learning 
environment by supporting teachers in the development of a 
series of intelligent tutoring systems for arbitrary domain 
knowledge and, conversely, by enabling students to learn, 
test themselves, consult the system for the obtained score, 
as well as receive from it advice for further work. 


V. SPECIFICATION OF AN INTELLIGENT TUTORING SYSTEM 
USER INTERFACE 


As already stated above most of an ITS can from the user’s 
standpoint be considered as a user interface. Its appropriate 
specification provides for the system’s refinement and 
especially for improving its adaptability to different users 
and various user classes, respectively. Because of its 
interface-specific symbols for actions and concurrency, and 
interrupts and feedback, we consider UAN appropriate 
enough for specifying ITSs’ behaviour, as well as for 
describing user actions. 


The above reasoning can be supported by considering the 
case of TEx-Sys whose user. interface belongs to the 
category of graphical ones, usually referred to as WIMP 
(Windows, Icons, Menus and Pointing devices) [11, 13]. 
Such interfaces support information presentation through 
graphical objects like windows, icons and menus, while 
interaction is performed using a combination of direct 
manipulation and menu selection, but can also be enhanced 
with function keys and form fill-ins. TEx-Sys menu portion 
possesses a relatively flat structure with eight first-level 
selection alternatives and only a two-level depth. As shown 
in Figure 2, the main first-level menu options include: file, 
which enables work with the knowledge base; node, for 
embedding objects and their structure attributes; /ink, for 
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embedding new users' links and for 
connecting/disconnecting objects in the knowledge base; 
hypermedia, for hypertext description and presentation of 
the structure attributes of objects in the knowledge base. 


Figure 2. TEx-Sys Graphical User Interface 


The specification of the operation adding a new node in the 
formalization of TEx-Sys domain knowledge illustrates 
UAN usage. The operation can be described in prose as 
follows: 


1. move the cursor to the Node alternative in the first-level 
selection of the main menu; depress and release the left 
mouse button; 

2. depressing and releasing the mouse button drops down 
the second-level selections for Node alternative, see 
Figure 3; 


Figure 3. Second-Level Selection Add 
for Node alternative 


3. move the cursor to the Add alternative in the second- 
level selection; depress and release the left mouse 
button, see Figure 3; 

4. depressing and releasing the mouse button enables the 
creation of an Add Node form fill-in, see Figure 4; 


|. Add Node 


Figure 4. Add Node Form Fill-In 


5. move the cursor to the Node Name window of the Add 
Node form fill-in; depress and release the left mouse 
button; write the name of the new node in the domain 
knowledge, see Figure 4; 

6. move the cursor to the Close button of the Add Node 
form fill-in; depress and release the left mouse button; 
depressing and releasing the mouse button closes the 
Add Node form fill-in. 


It can be noted that the above operation description consists 
of user actions along with interface feedback information, 
which is the interface response to user actions. Since user 
actions are interleaved with interface feedback there is no 
line-by-line association of feedback with the corresponding 
user actions [2]. The user action portion along with the 
interface feedback as described with an UAN specification 


Table 1. UAN Specification of User Action Add Node 
and Appropriate Interface Feedback 


TASK: add a node to the domain knowledge 


USER ACTIONS INTERFACE FEEDBACK 
~[Node_selection] Mva display(Node_Menu) 
~[Add_selection] Mva display(Add_Node form fill-in) 


~[Node_Name] Mv wieoet wih) Seek ere Nas | 
~[Node_Name_window in 
Add_Node_form_fill-in] Mva 


keyboard"name_of_node" 


~[Close_button in 
Add_Node_form_fill-in] Mva 


close(Add_Node form fill-in) 


offers a more precise correspondence between feedback and 
separate user actions in the sequence, as shown in Table 1. 


Table 2. UAN Specification of User Action Delete Node 
and Appropriate Interface Feedback 


TASK: delete a node from domain knowledge 


Because of space limitations only another one UAN 
specification is given in Table 2, so as to provide the 
specific flavour. This is the specification of the operation 
delete a node from the domain knowledge, whose effects 
(i.e. interface feedback) are shown in Figure 5 and 6. 


Delete Del =! ay 


Figure 5. Second-Level Selection Delete 
for Node Alternative 


Figure 6. Delete Node Dialogue Box 


VI. ADAPTIVE USER INTERFACES FOR INTELLIGENT 
TUTORING SYSTEMS 


Present day interactive applications make it an ever- 
increasing requirement to develop user interfaces that 
exhibit adaptability. This is even more important for ITSs, 
which are not only to support a more efficient and effective 
user interaction, but also to change the provided 
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functionality on the basis of users' individual characteristics 
in order to address their needs and requirements. This 
foresees an understanding of diverse kinds of ITS users 
(student or teacher, novice or expert, to mention only some 
most prominent extremes), as well as their usually 
conflicting demands and individual needs. 


It should be noted that ITSs themselves represent adaptive 
systems since they incorporate sufficient intelligence to 
react properly to users’ inputs. However, it seems that up to 
now not enough attention has been given to the adaptability 
of the respective user interface. This adaptation should 
comprehend the following actions: (i) adaptation of the 
user-ITS communication, with the user performing the same 
operations but the interaction style adapting to user habits; 
and (ii) adaptation of ITS functionality, providing the users 
newly shaped operations to improve (what usually reduces 
to speeding up) their interaction with an ITS as they gain 
knowledge of the system and develop a_ personal 
communication style. 


Figure 7. Adaptive ITS Architecture 


Hence the ITS user interface should be designed so as to 
possess a degree of intelligence which would parallel that of 
the ITS proper and effectively provide a proactive like 
behaviour. This means that the user interface should 
monitor and store users’ behaviour thus requiring a suitable 
history log. On the other hand an appropriate “trigger” 
should provide the user inierface modification subject to a 
particular “threshold level”, which determines when to 
perform the necessary changes, while an “inference engine” 
should provide the information (what) on the changes to be 
performed. This results in an ITS architecture as shown in 
Figure 7. 


VII. CONCLUSION 


Adaptive user interfaces are a promising attempt to 
overcome contemporary problems in human-computer 
interaction due to ever-increasing user requirements and 
expectations. Particularly in the last few years, the need for 
such an approach has been recognized in diverse application 
areas in order to automatically adjust to a palette of users. 
This is especially true in education where particular CAI 
systems have been developed with the aim to broaden the 
scope of the teaching and learning process. Considering 
intelligent tutoring systems, of which TEx-Sys is a 
representative, it can be noted that their efficiency, 


effectiveness as well as usability would certainly gain from 
the inclusion of adaptability in the respective user interface. 
In the paper we elaborate on user interface issues for 
intelligent tutoring systems and highlight a possible ITS 
architecture that is to support adaptability basing on built-in 
intelligence. 
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The development of an ISO 9000 Quality 
Management System to meet the requirements 
of the European Foundation for Quality 
Management (EFQM) Model - The Cyprus 
Telecommunications Authority's Case. 


Michalis Achilleos', George Ioannou” 


Abstract— 


The Cyprus Telecommunications Authority's (CYTA) 
modernisation programme focuses on the use of Business 
Process Reengineering tools in order to improve the 
organisation's customer focus and people's focus orientation 
with parallel improvements in productivity, the ability to 
develop and offer products and services with market 
improvements in operational flexibility. The Authority also 
in its search to find a modern management model able to 
support this modernisation programme and its path towards 
excellence, has opted for the European Foundation for 
Quality Management excellence model’. 


In a parallel but not separated move, the Authority has 
decided to embark on the development and implementation 
of the ISO 9000 quality systems standard in its customer 
service operations. The first such system to be certified in 
June 1999, was the Authority's Limassol Customer Order 
Fulfillment Department and the second one being the whole 
of the Customer Services in the area of Larnaca. 


The purpose of this paper is firstly to define how an ISO 
9000 certified business unit can be measured against the 
requirements of the EFQM Model and secondly outline a 


‘Quality Assurance Manager for Customer Order 

Fulfillment Department of the Cyprus Telecommunications 
Authority - Limassol Area and currently the Head of Lamaca 
Customer Services. 


*Senior Manager, Management Consulting Services department, 
PricewaterhouseCoopers 


plan of action that could be used to guide this business unit 
towards the EFQM Model of excellence. 


Applied science and the development on new technology 
amongst the other aspects is also geared towards the design, 
development, production and service provision for the 
ultimate “customers” who are the users of this new 
technology. This whole activity cannot happen in a 
vacuum. It has to be effectively managed so as to take into 
account the people, the resources, the plant processes, the 
leaders and the ultimate "customers". 


The Model 


The European Foundation for Quality Management Model 
(EFQM) is a way of managing activities based on the Total 
Quality Management concepts that can lead an organisation 
to sustainable business excellence. The main Total Quality 
Management concepts engaged include customer focus, 
supplier partnerships, people development and involvement, 
systematic process management, continuous improvement, 
leadership, public responsibility and results orientation. 


A graphical representation of the model is shown in figure]. 
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Fig 1: The EFQM excellence model. 


The nine elements or criteria are basically the TQM 
concepts noted above expressed in the following logical 
sequence: 
Leadership, People Management, Policy and 
Strategy, Resources and processes can achieve 
Customer Satisfaction, employee satisfaction, 
positive impact on society and can ultimately lead 
to business excellence. 


The first five criteria are the Enablers that are concerned 
with how the organisation approaches each of the criterion 
parts. The last four criteria are the results and are 
concerned what the organisation has achieved and is 
achieving. 


The nine criteria do not operate in vacuum. There are links 
between each criterion. For some criteria, the link is more 
obvious and direct, e.g. people management and employee 
satisfaction, or process management and business results. 


Self- assessment is a comprehensive, systematic and regular 
review of an organisation's activities and results against the 
nine criteria (or model elements) and their sub criteria. It 
can be initiated in the organisation as a whole or a sub- 
division of the organisation. 


EFQM and the Cyprus Telecommunication Authority 
(CYTA) 


CYTA has opted for the EFQM model because in the words 
of the current General Manager "... the basic principles of 
the EFQM model and its objectives coincide with CYTA's 
modernisation programme called Prometheus. This 
modernisation programme is used by CYTA to transform 
itself to a modern dynamic organisation that could be able 
to survive in a competitive market place." 


Processes 


People Results 


— 
Perform. 
ema Results 
Society Results 


CYTA has already created a central EFQM co-ordinating 
function and at least one subdivision the Larnaca Area 
Customer Services is currently going through the process of 
self assessment. ditionally, Larnaca Area Customer 
Services is using its newly implemented ISO 9002: 1994 
Quality Management System. 


The first department to certify its quality management 
system against the ISO 9002: 1994 standard was the 
Customer Order Fulfillment department in the Limassol 
area. 


The EFQM Model and the current version of Quality 
Systems Standard 


The current version of ISO 9000 series of Quality Systems 
Standards specify the requirements aimed at achieving 
primarily customer satisfaction by preventing 
nonconformity at all process stages from design though to 
servicing’. They are what can be called compliance 
standards aiming at demonstrating confidence in product or 
service conformance. Issues such as leadership people 
management, resources, people satisfaction, customer 
satisfaction, impact or society and business results are not 
seriously considered. More specifically, the current ISO 
9000 standards is linked with the EFQM model criteria as 
follows: 


(a) Leadership: 

Management must demonstrate 
commitment towards the implementation 
of a quality management system. Also 
management is responsible for setting the 
organisation's Quality Policy and Quality 
Objectives. In the case of the EFQM 
Model, leaders must inspire support and 
promote a culture of Total Quality 
Management, which has a wider scope 
than Quality Assurance. 


ee 
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(b) 


(c) 


Policy and Strategy: 

Although in ISO 9000 Quality Systems 
the policies are more focussed on quality 
rather than more open as in the EFQM 
Model, the method of developing. 
communicating and updating is similar to 
the EFQM stated criteria, provided the 
leadership is fully committed to this 
process. 


People management: 
The ISO 9000 standard, specifies the need 


for competent people whose training © 


needs are defined, reviewed and satisfied, 
whereas the EFQM model goes much 
deeper assessing the involvement, 
empowerment and recognition, their 
agreement on targets and performance, 
the caring for people and the need for an 
effective dialogue. 


SS 


Policy 
x 
RAS WS 


= 
= 


Enablers 


Fig 2: The ISO9000 / EFQM connectivity 


effective use for the information collected 
in corrective/preventive action activities. 
The EFQM Model calls for the provision 
of results relating to the customers” 
perception on one hand and on the 
customers perception on the organisation. 


(g) People satisfaction: 
The EFQM Model asks for the provision 
of information regarding the expectation 
of its employees and their satisfaction. 


(h) Impact on society: 
The EFQM Model calls for the existence 
of evidence regarding the satisfaction of 
the needs and expectation of society 
relating to the perception of the 
organisation and the measurement of the 
organisation's impact on society. 


Results 


(The gray colour denotes ISO 9000) 
(and EFQM Model connectivity ) 


(d) Resources: 
The ISO 9000 Standard calls for 
resources to be identified and be provided @ Business results: 
for in a generalised statement whereas the While in the ISO 9000 Quality Systems 
EFQM Model is specific on which the monitoring and control focuss on the 
resources have to be effectively managed process management side, in the EFQM 
and the method by which they are Model the performance of the whole 
managed. Such resources are financial, organisation has to be demonstrated and 
information, supplier relationships, assets, compared against competitors or best in 
technology and intellectual property. class organisations. 
(e) Processes: The ISO 9000 / EFQM connection is 
An effective ISO 9000 Quality System shown in figure 2. 
could be able to meet the EFQM 
requirements, always depending on the In spite of the above analysis of comparison between the 
commitment of management. The EFQM current ISO 9000 Quality System Standard and the EFQM 
Model requires more insight into the Model, the preliminary results of the Lamaca Area 
target setting and also the need for using Customer Services from the Self Assessment Questionnaire, 
innovation and creativity. show that the department has no serious deviations from the 
EFQM Model. The main reason for this, is that duri 
ea) Customer satisfaction: development and implementation of Spe wi 
The ISO 9000 standard calls for the system, the department went well beyond the strict 
proper handling of customer complaints compliance with the requirements of the standard. It went 
on one hand and on the other hand. the on to take into consideration a number of the TQM 
se — 
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principles including the need for continuous improvement, 
the involvement and communication with personnel, the 
setting of quality targets and criteria and monitoring of their 
achievement and the customer focus orientation. 


For the process of self assessment, the Larnaca Area 
Customer Services is utilising the workshop approach. This 
approach requires the involvement of the management team 
for data gathering and review. There are usually five 
elements in this process, namely training, data gathering, 
scoring workshop, agreeing improvements and reviewing 
progress. 


The workshops are currently at the scoring phase during 
which each one of the 32 criterion parts is scored using a 
structured approach. Additionally, they discuss and agree 
on the relevant strengths and weaknesess of their existing 
processes and ideas for improvement. 


As a vehicle for further analysis and implementation of the 
improvement, the department is planning to use the existing 
system of continuous improvement, which is currently 
embedded in the quality management system. This system 
is based on a central function (central complaints and 
corrective action function) which is centrally managing all 
instances of customer complaints, internal problems with 
the aim of designing and implementing the necessary 
corrective action and to make sure that the corrective action 
implemented is effective. 


The new edition ISO 9001:.20006 


The new version of Quality System Standards is going to 
come into effect from 1 January 2001. This new version is 
addressing the company's quality system in completely new 
setting, both with respect to concept and structure. It 
follows in the footsteps of the EFQM Model. Its conceptual 
model is structured on the process approach "Plan, Do, 
Check, Act" and based on the Total Quality Management 
principles discussed earlier. 


The focus is shifted towards customer satisfaction, people’s 
satisfaction, measurement, analysis and improvement. 
Companies will be asked to change their existing ISO 9000: 
1994 Quality Management Systems to meet the 
requirements of ISO 9001: 2000 within a specified period 
of time’. 


It is believed that this transformation will assist the EFQM 
process greatly because of the better compatibility between 
the two. Additionally, ISO 9001: 2000 is an auditable 
standard in which the organisation must provide evidence 
of compliance. The evidence required will also be used as 
evidence towards the implementation of the EFQM model. 


Examples of such evidence required are: 
e Identification and fulfillment of customers’ needs 
e Existence of measurable objectives for each function 


e Communication between various levels of the 
organisation 

e Provision of necessary resources and assignment of 
personnel 

e Process verification and validation activities 

e Measurement activities to achieve improvement 
including customer satisfaction 

The major difference is that ISO 9001:2000 is more 

focussed on the product or service while the EFQM Model 

has a wider scope encompassing wider activities of the 

organisation. 


CYTA’s Larnaca Area Customer Services is currently 
going through a crucial phase in which it is operating its 
newly implemented ISO 9002:1994 quality management 
system, it is going through Self Assessment for the EFQM 
Model and additionally it will have to transfer its quality 
management system , in the near future to meet the 
requirements of the new version ISO 9001:2000. It is a 
process in which as noted earlier, each element or aspect 
contributes towards the success of the other. For example, 
because of the EFQM process the current quality 
management system will have no difficulty in its transfer to 
the new ISO 9001:2000 version. 


Additionally it is a process in which both management 
commitment and employee involvement and motivation are 
continually tested. The current signs show that there is 
enough of commitment and motivation to guide the 
department to both the certification to ISO 9001:2000 and 
the European Quality Award. It has to be noted that 
achieving the latter is a much longer process than achieving 
quality management systems certification. 


This experiment is one the first of its kind for 
telecommunication services in the Mediterranean region. 
The results will be not only closely watched by the rest of 
CYTA but also by the regional telecommunications 
authorities and other similar utility services providers. 
Further papers will be published that will report on the 
results and the progress of the project. 
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Towards QoS specification for distributed 
virtual environments 


Maja MatijaSevi¢, Denis Gra¢anin, Kimon P. Valavanis, Ignac Lovrek 


Abstract— Understanding the Quality of Service (QoS) re- 
quirements of multimedia-rich and highly interactive applica- 
tions such as distributed virtual environments (DVEs) is an 
open research issue. As a contribution to the ongoing research, 
this paper proposes a set of high level QoS characteristics for 
DVEs, namely, interactivity, immersion, and plausibility. Each 
QoS characteristic is further related to quantitative and quali- 
tative application and communication level parameters. The 
proposed approach is illustrated by an example. 


Index Terms—virtual reality, quality of service, Internet 


I. INTRODUCTION 


The research described in this paper has been motivated by 
the challenge to provide a set of Quality of Service (QoS) 
characteristics for distributed virtual environments (DVEs). 
Virtual malls, collaborative design and prototyping, military 
training, and telemedicine are only few examples of DVEs. 
Considering the attractiveness and variety of such applica- 
tions there is a need for better understanding of their re- 
quirements in terms of QoS. 

ISO/IEC and ITU-T define QoS as “the collective effect of 
service performances which determine the degree of satis- 
faction of a user of the service” [1]. For DVEs, the user of 
the service is human, and the service performance is evalu- 
ated in terms of human perception, i.e. subjective and 
qualitative, rather than objective and quantifiable terms. 
The notion of QoS at the user/application level is expressed 
differently from that at the communication level. Due to a 
variety of application areas, as well as design approaches, a 
consistent way of mapping or propagating the user require- 
ments to the communication level is an open research issue. 
In this paper, a general method is proposed, that is based on 
a minimalistic hierarchically organized structure of QoS 
characteristics for evaluation of various DVEs. The paper is 
organized as follows: Section II describes the proposed set 
of user/application level QoS characteristics to represent the 
highest level of the QoS hierarchy, and relates them to 
lower-level characteristics. Section III illustrates the pro- 
posed approach using as an example a DVE for teleopera- 
tion of a mobile robot. Section IV concludes the paper. 


II. USER/APPLICATION LEVEL QOS CHARACTERISTICS 


When evaluating the service performance in terms of hu- 
man perception, it is practically impossible to go beyond 
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the rule-of-thumb approach. For example, “well-known” 
ranges of audible sound (20 Hz-25,000 Hz), and visible 
light (390 nm—800 nm) are actually only well-measured and 
averaged. Similarly, smooth motion for motion video is, in 
average, observed at 25-30 fps (frame / second), but the 
same effect for animation may be achieved at frame rates as 
low as 15 fps. Research in audio and visual media temporal 
composition shows that what users consider to be “accept- 
able quality” falls into a variable range of values [2,3]. 

The empirical approach, using techniques such as question- 
naires, interviews, and physiological monitoring, is consid- 
ered the most useful and reliable for obtaining user’s 
evaluation. The method of quantifying user evaluation for 
each variable uses a k-point Likert scale. This scale, used in 
behavioral research, provides points for rating a statement. 
A good example of such an approach is the Mean Opinion 
Score, a universally accepted method for estimating the 
overall acceptability of voice communications systems. The 
users mark their evaluation between the value (1) for the 
lowest, and the value (4) for the highest agreement or 
evaluation. For example, a 5-point Likert scale may be 
given to evaluate the following statement: 


I felt immersed in the virtual world. 
Not ata ae) 5 5 Completely 


In this example, rating 4 is interpreted as immersion of 4/5. 
To relate the service performance (as observed by the user) 
with measurable system performance, it is necessary to de- 
fine a set of characteristics at a user level which may subse- 
quently be mapped to measurable application-related quan- 
tities. A QoS characteristic is defined [4] as “a quantifiable 
aspect of QoS, which is defined independently of the means 
by which it is represented or controlled”. The following 

QoS characteristics are proposed to represent the highest 

level of the QoS hierarchy: 

e Interactivity: It is defined as the user’s evaluation of 
the ability to interact with objects and other users in a 
DVE, as well as extent of such interaction. 

e Immersion: It is defined [5] as the presentation quality 
of (multi)sensory information that a user perceives as a 
3D synthetic environment. Thus, immersion may be 
quantified as a combination of visual, auditory, and 
haptic immersion. 

e Plausibility: It is defined as the user’s acceptance of 
events experienced in the virtual world as apparently 
reasonable and valid based on one’s experience. 


An evaluation of each of these three characteristics may be 
obtained empirically, in a study involving representative 
users, network configuration, and an application prototype. 
Our goal is to relate user evaluation with quantitative, 
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measurable application and communication parameters. 
Each highest level characteristic is quantified as a linear 
combination of quantities representing constituent charac- 
teristics. In the minimalistic approach, it is assumed that 
each constituent characteristic has an equal contribution to 
the high-level characteristic. To generalize, different weight 
factors may be assigned to different characteristics. Multi- 
ple options in a specific DVE may be evaluated separately. 


A. Interactivity 


In the context of any multi-user VE, interactivity is related 
to scope and extent of user’s actions in the VE. Thus, inter- 
activity is quantified as a linear combination of variables 
representing the sensory input support, navigation, and, 
user representation. 


1) Sensory input 
Sensory input support includes tracking the user and user’s 
actions in the real world and interpretation thereof in the 
virtual world. A choice of input device(s) and its use deter- 
mine the quality of sensory support. Table 1 gives the pro- 
posed ranking of tracking and command input techniques. 
The high-end of the scale for sensory input includes track- 
ing, and gesture- and speech recognition. At the low-end, 
minimal requirements are 2D-pointer (mouse) and keyboard 
input. Other variables may be chosen for gesture and speech 
recognition evaluation, for example, the number of recog- 
nized gestures and words (vocabulary), etc. 


Table 1 Proposed ranking for sensory input support 


dash-board 
=< 
tracking 
i 


3 

Response time (lag) is of crucial importance for interactiv- 
ity. The requirements for response time depend in general 
on the type of display (auditory, visual, haptic), e.g., the re- 
quired response time for visual display is 100-300 ms. 


5 


2) Navigation 


The perception of movement of the viewer is called navi- 
gation. Depending on the virtual world topic, different 
navigation styles may be available: walking, flying, 
teleportation, examination, etc. To support user interaction 
in a multi-user environment, some techniques may be more 
appropriate than others. Table 2 shows the proposed rank- 
ing of navigation techniques. At the lowest end of the scale 
is the locked camera view, where the user is only a passive 
observer on a “guided tour” of the virtual world. Next is 
teleportation, which allows more choices, but remains 
bound to predetermined, named locations in the VE. Ex- 
amine-style allows a user to observe the virtual world as a 
single object from different points of view. Although ex- 
amination and teleportation may assist the user to quickly 


get acquainted with the virtual world, they are not appropri- 
ate for other types of interaction. More realistic navigation 
techniques include flying and walking, the applicability of 
which depends on VE theme, for example, flying (6 DOF, 
collision detection) for flight simulation. Walk paradigm 
with terrain following and collision detection is the most 
natural way of exploring a physics-based environment. 


Table 2 Proposed ranking of navigation techniques 


| 1 | Locked camera (none) _| 
Examine 


3) User representation 
A user’s representation in the virtual world is referred to as 
an avatar. In a multi-user DVE that facilitates user interac- 
tion, it is expected that a user may chose an avatar, as well 
as to customize it according to personal preferences. An 
evaluation of an avatar includes the avatar’s appearance, as 
well as mapping of user input to avatar’s behavior. From 
the user’s point of view, this is an ability to articulate the 
virtual body parts and interact with the environment. Avatar 
interactions include user—object interactions, and user—user 
interactions. The proposed ranking of avatar quality is 
given in Table 3. 


Table 3 Proposed ranking of avatar quality 


Appearance / Articulation 


none (viewer 


2 | icon (pointer) 


5 | speech, facial expres- 
sions 


B. Immersion 


Immersion is quantified through quality of presentation, or 
output, of (multi)sensory information. The sensory output, 
in general, is a result of the simulation corresponding to 
each type of output (visual, auditory, haptic). It is presented 
through an appropriate display, ranging from basic visual 
and auditory displays, to haptic displays and smell emitters. 
For the purpose of this paper, immersion is quantified in 
terms of visual and auditory quality and their synchronized 
presentation. The user evaluation of synchronization corre- 
sponds to perception of delay and jitter (some experimen- 
tally established values may be found in [6]). 


Avatar interaction 
[1 | none (viewer) | 
2 | object selection and 

manipulation 
collaborative 
social interaction 


5 | avatar—avatar di- 
rect interaction 


I) Visual quality 
Visual quality is defined as the user’s assessment of quality 
of visible elements in the virtual world. This includes all 
media presented through the visual display, such as text, 
image, 2D and 3D graphics, animation, video, etc., for 
which a set of quantitative characteristics may be defined. 
In this paper we assume the raster-scanned graphic display 
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(screen or HMD) as visual output device. Quantitative 

characteristics of visual quality include: 

e Color: The number of bits allocated for pixel color. 

e Resolution: The product of the number of pixels on the 
horizontal axis and the number on the vertical axis of a 
viewing area (window) on a computer display screen. 

e Level of detail (LOD): The amount of detail or com- 
plexity which is displayed at any particular time for a 
geometric object . 

e Field of view (FOV): The part of space seen without 
movement of eyes or head. In the real world, typical 
values are estimated at 180°-200° for horizontal (bin- 
ocular overlap 120°), and 120°-135° for vertical FOV. 

e Frame rate: The number of frames displayed per time 
unit, quantified as frame/second (fps). 

e Number of visual channels: The number of separate in- 
coming streams from one or more visual information 
sources. 

Questions related to scene quality should address color, 

resolution, LOD, FOV, and the number of channels; and 

those related to motion quality should address the frame 
rate. The description of the above characteristics for the 

purpose of user evaluation need not use technical terms. A 

user-friendly approach, such as “query-by-example” [7] 

proposed for use in distributed multimedia applications may 

be appropriate. This approach is guided by the idea to allow 
the user express his/her requirements using a set of prede- 
fined settings as an example. Table 4 shows visual quality 
parameters, with typical limits. 


Table 4 Proposed visual quality parameters 


8 bit/pixel, 256 color 352x240 
24 bit/pixel, “true color” 1024x768 


120 


Other parameters of visual quality, such as textures and 
rendering quality may be quantified in terms of file size, 
applied algorithms, and graphic processing capacity. The 
achievable scene quality may be observed and described at 
the user level as shown in Table 5. 


Table 5 Proposed ranking of scene quality 


Scale_| __Userview | Algorithm 
| 1 [wireframe | no filling 
f 


2 non-uniform shading 
algorithm 
uniform shading 
Phong, Gouraud 


= 
Sa | 
ed 
Bate 


illumination, shadow lighting equations 


alpha-blending 
2) Auditory quality 


Auditory quality is defined as the user’s assessment of all 


i, “photorealistic” 


no spatialization 


2 beep (single frequency) stereo (2 channels / 
source, phase shift 


audible effects in the virtual world being presented at the 

user interface. In user terms, auditory quality is evaluated in 

terms of sound quality and sound spatialization. Quantita- 
tive characteristics of auditory quality include: 

e Audio sampling rate: The sampling frequency, in Hz. 

e Sample size: The size of an audio sample, in bytes. 

e Number of audio channels: The number of separate in- 
coming streams from one or more audio information 
sources. 

The combination of the above parameters may again be 

presented for evaluation in non-technical terms, using 

questions about quality of each sound, and its spatial char- 
acteristics such as perceived direction. Table 6 shows pro- 
posed ranking of auditory quality. 


Table 6 Proposed ranking of auditory quality 


phone quality 
(8000 Hz, 8 bit/sample 
CD quality (44100 Hz, 


spatialized sound 
sound simulation 


stereo, 16 bit/sample 


C. Plausibility 


Plausibility combines the effects of interactivity and im- 
mersion. It relates the user’s association of the performed 
action with its observed effect in the virtual world. One is- 
sue is the cause-and-effect relationship, or causality, and 
the other involves laws of physics in general. For example, 
a VE may be both interactive and immersive, however, if 
dropped objects do not fall, or avatars pass through solid 
objects, it is not plausible in terms of the above definition. 
Thus, providing a plausible feedback depends on simulation 
capabilities. 

The problem with causality is non-trivial and a direct con- 
sequence of users distribution, ordering, and end-to-end 
delay at the communication level. In DVE, from a particu- 
lar users point of view, the state of the virtual world is actu- 
ally a “view in the past”, since each remote object is up- 
dated locally only after an update has been received over 
the network. This delay may vary for each pair of users, 
causing a different perception of the current state. This, in 
combination with user’s actions that change the VE, may 
lead to undesired effects and a loss of consistency. 


III. EXAMPLE: A TELEROBOTICS APPLICATION 


An example DVE developed for teleoperation of a mobile 
robot [8,9] over the Internet is shown in Figure 1. It has 
been implemented using the DIS-Java-VRML [10] and IP 
multicast at the network level. Here, we address interactiv- 
ity, immersion, and plausibility, and comment on commu- 
nication level QoS characteristics. 


1) Interactivity 
Interactivity has been defined as a combination of user 
evaluations for sensory support, navigation, and user repre- 
sentation. The interactivity parameters are given in Table 7. 
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Figure 1 Snapshot of the telerobotics DVE 


In this DVE, users do not have self-representing avatars, 
but when the operator moves the robot, the robot acts as the 
operator’s avatar. 


Table 7 Interactivity parameters 


1) Poi input Or 
Navigation predefined viewpoints 
Examine ge capabill 


technique 
__|___browserdefault__| 


articulation 
sag 92 eel 
tion 


2) Immersion 
Immersion has been defined as a combination of visual and 
auditory quality. Auditory component has not been imple- 
mented. Visual parameters are shown in Table 8. 


virtual controls 
control panel 
virtual controls 


User 
ue | 


Table 8 Visual quality parameters 


al Ra EN I Be te | Chan 


[16 bitipixel | 640x480 | 44.97° | high | 25 fps | 1 | 


The frame rate display, incorporated in user interface, pro- 
vides a real-time information about current frame rate and 
its variation in different navigation modes. For fast walk 
and examine style navigation, frame rates of over 30 fps 
have been recorded, and smooth motion conditions have 
been satisfied. The virtual world model is relatively small 
(total of 1779 polygons, where the robot object is the most 
complex object consisting of 983 polygons). Table 9 pres- 
ents the scene quality parameters. 


Table 9 Scene quality parameters 


Algorithm 


| User view 


- smooth look uniform shading 
texture texture mapping 
illumination, shadow lighting equations 


3) Plausibility 
Plausibility has been defined as a combination of causality 
and physics. The physical simulation has been implemented 
in the mobile robot Nomad 200 simulator within Cognos 
Host Software Development Environment. Causality prob- 
lems due to latency and loss have not been observed in the 
testbed. This was an expected result in a LAN, where an 
measured average Round Trip Time (RTT) was in range of 
one millisecond, and typical loss was zero. Typical laten- 
cies may be estimated from the RTT and simulation inter- 
val. Although much greater latencies have been measured 
over the wireless connection, the overall latency was still 
small enough not to affect plausibility. 


4) Communication level QoS measurements 
Throughput and delay have also been measured with the 
objective to test the developed VE. Several single-user and 
multi-user scenarios have been studied and reported in [9]. 


ITV. SUMMARY AND CONCLUSIONS 


Our contribution to better understanding the QoS require- 
ments of DVEs, as a first step towards specification, is a set 
of QoS characteristics for DVEs, namely, interactivity, im- 
mersion, and plausibility. These user-oriented, subjective 
QoS characteristics are related to multiple quantitative and 
qualitative application and communication parameters for 
evaluation of DVEs. An example using a telerobotics DVE 
demonstrates the viability of the proposed approach. 
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Fault Analysis and Prediction in 
Telecommunication Access Network 


Damir Medved’, Koraljka Brlas' and Davor Sarié”, Member, IEEE 


Afdstract- Technological innovations and growing consumer 
demand have led to a variety of design and exploitation 
problems in telecommunication networks. In particular, local 
access networks have received a lot of attention, since they 
account for approximately 60% of total investments in 
communication facilities. Hence even marginal improvements in 
the access network design and maintenance can lead to 
substantial savings. 

Faults in telecommunication network have direct impact on its 
availability and maintenance costs, so their quick elimination, 
prevention and removal of causes that generated them, is of 
special interest. Possibility to analyse spatial distribution of 
faults and to predict places where future errors may arise can 
significantly help telecom operators who are in charge to detect 
and repair such problems. This data is also crucial for future 
planning and design of telecommunication networks as it can 
help to avoid problematic areas or use additional measures for 
protection of telecommunication cables and equipment. 
Existing statistical data (fault data and foreign factors data like 
precipitation, high-voltage installations etc.) is analysed and 
then a model of faults in telecommunication network is created 
using Bayesian methodology. The whole procedure of 
acquisition and interpretation of data is described, and an 
application called FATAN (Fault Analysis in Telecom Access 
Network) that aggregates all procedures is created. 

Achieved result gives us possibility to successfully determine 
locations where is likely that an error will arise. 


I. INTRODUCTION 


In past primary goal for most telecom operators in 
developing countries like Croatia was to reach all potential 
customers with telecom infrastructure, and do it fast. 
Shortage of adequate funding for replacing all older 
networks, leaded to weary heterogeneous situation (cables 
old up to 40 years in urban areas, cable insulations from 
paper to PVC, older networks with average lengths of 2,5 — 4 
km, newer networks up to | km, etc.). Maintaining such a 
network is quite complex problem; therefore an adequate 
documentation and fault analysis system is absolutely 
necessary. 

Today hottest question among telecom operators is QoS 
(Quality of Service) and how to satisfy customers and 


; Croatian Telecom 
Telecommunication Center Rijeka 
E. Baréiéa 6, HR-51000 Rijeka, Croatia 
_ e-mail: {damir_medved, koraljke brias}@ht br 
~ Crostian Telecom 
Non-voice Services Division, 
DraSkoviceva 26, HR-10000 Zagreb. Croatia 
e-mail: davor_saric@ht hr 


maximize profits. If you already have building blocks 
(applications, databases) such a described system, the logical 
next question is can you use this existing data to predict 
locations of future faults (with acceptable margins of 
probability) and prevent or minimize network downtime. 
The main purpose of this paper is not to discuss the detailed 
modelling issues of faults in local access network, rather our 
emphasis is to show design and implementation of the 
procedures in the fault tracking and analysis system that may 
become a standard building block of the company decision 
support system. 


Il. PROBLEM DESCRIPTION 


When problem was initially defined the first assumption was 
that the simplified model of telecom network would be used. 
There are many reasons for that; real world telecom network 
consists of huge number of network elements, which are 
constructed from many smaller parts and so forth. Obviously, 
description of all that elements (and all possible locations 
and sources of faults) is practically impossible. The relatively 
small Rijeka area has more than 140 000 subscribers 
connected with more than 800 cables to 200 switches and 
that leads us to few million possible fault locations. Next 
problem is that actual system for fault tracking does not 
acquire, in many cases, a lot of details (for example exact 
location of fault (coordinates), description of fault is 
somewhat arbitrary, technical documentation is not complete 
or actual for some networks, etc.). 

The main sources of data along with mentioned fault- 
tracking system are Telecom GIS system and meteorological 
data. 

GIS database currently holds site and schematic cable 
diagrams, duct and manhole drawings as well as OSP (DP, 
CCP) and switching equipment location and capacity. 
Georeferenced building and equipment locations with fault 
data attached are available for analysis. Up to now primary 
purpose of the GIS in TKC Rijeka was to support network 
operations and maintenance. Practically all departments use 
this database, primarily for real world visualization of 
telecom infrastructure, localization of demand for new lines, 
equipment and cable failure troubleshooting. 

When the GIS database was created, the primary goal was to 
establish connection between alphanumerical data in SQL 
DBMS and georeferenced information contained in 
numerous AutoCAD schematic and site drawings. In the 
future, special attention will be paid to adopt data acquisition 
and preparation procedures that will emphasize the 
importance of collecting new details in addition to O&M 
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related data, and this is the direct consequence of the 
development and implementation of described system. 
Along with telecom data GIS database contains data from 
local power company (high voltage installations, 
transformers), water installations, flood data etc. External 
meteorological database contains detailed data (daily 
acquisition) from many locations with temperature and 
precipitation. 
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Fig. 1. General model of fault analysis in telecommunication access network 


Generally all process of analysis can be dissolved in few 

steps (Fig. 1.): 

1. From GIS system extract locations that are by definition 
more threatened (near high voltage equipment, high 
areas with greater possibility for atmosphere discharges, 
flood areas etc.) 

2. Perform an statistical analysis of related fault data from 
fault tracking system 

3. Perform statistical analysis of meteorological data and 
other external factors 

4. Check if there is statistically significant correlation 

between points 1, 2 and 3 

Define graphical probability model (Bayesian network) 

6. And finally, extract and visualize menaced objects using 
GIS application 


oa 


Ill. TOOLS 


As mentioned before first step in process of data analysis is 
data preparation and transformation in form suitable for 
further work. In our case, GIS database is build using 
standard AutoCAD Map 3.0 and ESRI ArcView 3.1 tools. 
Statistical and belief network analysis is done using Netica 
Application 1.12 and Netica Programmer’s Library, also 
known as the “Netica API” (Application Program Interface) 
from Norsys. Netica is a versatile, fast, user-friendly program 
that can be used to find patterns in data, create diagrams 
encoding knowledge or representing decision problems, use 
these to answer queries and find optimal decisions, and 
create probabilistic expert systems. 


IV. BAYESIAN NETWORKS 


A Bayesian network is graphical model for probabilistic 
relationships among a set of variables. Few years ago 
researches have developed methods for learning Bayesian 


networks from data. These techniques are new and still 

evolving, but they are quite effective for some data-analysis 

problems. There are four main reasons why Bayesian 

methodology is useful in fault prediction system [1]: 

1. Bayesian networks can readily handle incomplete data 
sets. For example, in classification or regression problem 
where two of the explanatory or input variables are 
strongly anti-correlated. This correlation is not a 
problem for standard supervised learning techniques, 
provided all imputes are measured in every case. When 
one of the inputs is not observed, most models will 
produce inaccurate prediction, because they do not 
encode correlation between the input variables. Bayesian 
networks offer a natural way to encode such a 
dependencies. 

2. Bayesian networks allow learning about casual 
relationships. This is important for at least two reasons. 
The process is useful when we are trying to gain 
understanding about a problem domain, for example, 
during exploratory data analysis. In addition, knowledge 
of casual relationships allows us to make predictions in 
the presence of interventions. 

3. Bayesian networks in conjunction with Bayesian 
statistical techniques facilitate the combination of 
domain knowledge and data. Bayesian networks have a 
causal semantics that makes the encoding of casual prior 
knowledge particularly straightforward, and also encode 
the strength of casual relationships with probabilities as 
consequence; prior knowledge and data can be 
efficiently combined. 

4. Bayesian methods in conjunction with Bayesian 
networks and other types of models offers an efficient 
and principled approach for avoiding the over fitting of 
data. This means that models can be “smoothed” in way 
that all available data can be use d for training. 


A. Belief network learning 


Belief network learning [2] is the automatic process of 
determining a suitable belief network with given data in the 
form of cases (Fig. 2). Each case represents an example, 
event, object or situation in the world (presumably that exists 
or has occurred), and the case supplies values for a set of 
variables, which describe the event, object, etc. Each variable 
becomes a node in the learned network (unless you want to 
ignore some of them) and the various values it takes on 
become the state of that node. Some cases may not have 
values for some variables that other cases do, which are 
known as missing data (shown as * in table). 

The learned network can be used to analyze a new case 
drawn from a population similar to the cases used for 
learning. Typically the values for some variables of the new 
case will be known. These are entered as findings, and then 
probabilistic inference is done to determine beliefs for the 
values of the rest of the variables for that case. If the network 
links correspond to a causal structure, and the query nodes 
are ancestors of the findings nodes, then you could say that 
the network has learned to do diagnosis. If the query nodes 
are descendants, then the network has learned to do 
prediction, and if the query node corresponds to a “class” 
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variable, then the network has learned to do classification. Of 
course the same network could do all three, even at the same 
time. 


Fig. 2. Case file for network learning 


The belief network-learning task has traditionally been 
divided into two parts: structure learning and parameter 
learning. Structure learning determines the dependence and 
independence of variables and suggests a direction of 
causation; in other words, the placement of the links in the 
network. Parameter learning determines the conditional 
probability relationship at each node, given the link 
structures and the data. Currently Netica only does parameter 
learning (i.e., you link up the nodes before learning begins). 
Netica uses a true Bayesian learning algorithm. While 
learning, Netica assumes independence between each of the 
conditional probabilities of a node’s relation to its parents. 
This works very well when there is lots of data (or nodes 
have few parents), but can result in under-confidence and 
poor generalization otherwise. Netica may not have seen 
some parent configuration, so the learned probability 
distribution is uniform for it, but Netica might do better if it 
assumed that parent configuration behaved similarly to other 
similar parent configurations (i.e., assumed a little more 
independence between variables, and a little less 
independence between conditiona! probabilities) 


V. COMPUTATIONAL RESULTS 


In analysis of telecom access network fault data for Rijeka 
region more than 65000 cases where considered (from years 
1997 to 2000). This is relatively short period of time but it 
was impossible to collect all necessary data for years before 
1997. As new data is constantly acquired the model with self 
enhance in future, and this is probably one of the most 
valuable achievements of this approach. 

Using Netica [4], a belief network for faults in 
telecommunication networks is constructed which captures 
believed relations (which may be uncertain, stochastic, or 
imprecise) between a set of variables which describes 
simplified model of telecommunication network (Fig. 3.). 
When the belief network is constructed, one node is used for 
each scalar variable, which may be discrete, continuous, or 
proposition (true/false). 
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Fig. 3. Belief network for telecom fault prediction 


The nodes are then connected up with directed links. If there 
is a link from node A to node B, then node A is sometimes 
called the parent, and node B the child (of course, B could be 
the parent of another node). Usually a link from node A to 
node B indicates that A causes B, that A partially causes or 
predisposes B, that B is an imperfect observation of A, that A 
and B are functionally related, or that A and B are 
statistically correlated. 

Finally, probabilistic relations are provided for each node, 
which express the probabilities of that node taking on each of 
its values, conditioned on the values of its parent nodes. 
Some nodes may have a deterministic relation, which means 
that the value of the node is given as a direct function of the 
parent node values. 
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Fig. 4. Example of fault analysis for variable electric discharging (greater 
spots define most problematic locations) — analysis show that most 
problematic areas are near by large metallic structures (high voltage 

transformers, oil raffinery, etc.) 


After the belief network is constructed, it may be applied to a 
particular case. For each variable you know the value of, you 
enter that value into its node as a finding (also known as 
“evidence’”’). Then Netica does probabilistic inference to find 
beliefs for all the other variables. Suppose one of the nodes 
corresponds to the variable “Temperature”, and it can take on 
the values cold, medium and hot. Then an example belief for 
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temperature could be: [cold - 0.1, medium - 0.6, hot - 0.3], 
indicating the subjective probabilities that the temperature is 
cold, medium or hot. Changing values (temperature, 
precipitation etc.) of variable or variables we can obtain 
probabilities of fault in given telecommunication network 
and after visualization with GIS system we can easily locate 
problematic areas (Fig. 4.) 


VI. OVERALL SYSTEM DESIGN 


The FATAN system is implemented in Microsoft Visual 
Basic 6.0 and ESRI MapObjects 2.0. It has user-friendly 
interface, and it is possible to choose various types of 
analysis, choose area and telecommunication equipment etc. 
As sample (Fig. 5), a module that allows changing variables 
for precipitation and temperature and immediately shows 
probabilities for faults in selected area, is given. 
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Fig. 5. Probability of fault when changing variables for temperature and 
precipitation 


VII. FURTHER WORK 


Greatest problem in fault prediction in telecommunication 
access networks is missing or incomplete data, so greater 
effort for modifying and enhancing existing applications for 
fault tracking and enforcing discipline in data acquisition will 
be primary goals in years to came. 

Tighter integration of FATAN with the GIS is a natural next 
step that will provide more efficient analysis environment. In 
addition to that, new forecasting modules will enhance 
overall capabilities of system. 


VIII. SUMMARY AND CONCLUSIONS 


The three pillars of the project; 

1. Implementation of an efficient and proven Bayesian 
methodology to problem of fault analysis in 
telecommunication networks, 


2. Usage of all available data stored in many different 
telecom internal and external databases, efficient 
conversion and aggregation of various data types and 
enabling fast problem modeling and visualization, 

3. Up-to-date fault analysis and simulations from the 
modern GIS application, 

has enabled a small team of professionals to achieve valuable 

results in a short available time, and show that is possible and 

economically rightly build such a system, all in desire for 
further QoS enhancement for our customers. 
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TTICN QUALITY METRICS 
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Abstract 


Testing is a major factor in the successful development of 
efficient and reliable telecommunication implementations. 
Time and resources for testing are limited for economic and 
competitive reasons; therefore improvement in the test 
specification procedure is of particular interest for the 
industry. 


The need for a unified approach in conformance testing, led 
to the publication of the Conformance Testing 
Methodology and Framework (CTMF) standard ISO/IEC 
9646. The Tree and Tabular Combined Notation (TTCN), is 
the standard language used to specify Abstract Test Suites 
(ATSs)[3]. Test Suites written in TTCN are contained in a 
Machine Processable (MP) format. 


It is widely accepted that the observed quality of TTCN test 
suites published by international committees (ETSI, ITU-T) 
is often regarded as poor. This is an indication of 
weaknesses in current Quality Assessment (QA) practices. 


In this paper an automatic quality assessment method for 
Abstract Test Suites is initially defined. Then based on the 
metrics derived for specific TTCN notation rules, total 
quality designation of an ATS is proposed. 


Criterion_1 


Criterion_n 


Notation 
Rules 


I. INTRODUCTION 


For the quality assessment of any kind of software, a 
stepped approach is required. The phases, which lead to the 
automatic quality assessment of an ATS are proposed 
below: 

e The interpretation of a quality assessment model 
consisting of quality factors and quality criteria. 

e The creation of a set of rules for the TTCN notation to 
assure that the previous quality factors and criteria are 
satisfied. 

e The creation of a series of static measurement, called 
metrics, which implement the TTCN notation rules. 

e The definition of specific quality limits for each metric 
based on the interpretation of the corresponding rule. 

e The generation of a quality assessment process, based 
on mathematical equations, extracted from metrics and 
relationships between the TTCN notation rules, quality 
factors and criteria. 


The arrangement of these steps, in a form of a quality 
assessment tree, is depicted in Figure 1. Each of these steps 
is described in detail within the following paragraphs. 


Total 
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Figure 1: Quality Assessment Tree 
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I]. THE QUALITY ASSESSMENT MODEL 


During the last decades many quality models for software 
quality assessment have been developed. Since the 
development of an ATS is similar to the standard software 
development a similar approach can be followed for the 
quality assessment of the ATSs. The most known and used 
quality assessment model is the one which was developed 
by McCall, Richard and Walter[11]. This quality model is 
based on a number of quality factors. 
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In order to make the evaluation process easier and more 
general the, quality factors spilt into a number of 
corresponding criteria. 


III. DEFINITION OF RULES FOR THE TTCN NOTATION 


The following step to the evaluation process in the 
definition of quality rules for the TTCN notation, which 
influence the quality criteria of McCall’s quality model. 
These rules can be grouped according to the elements that 
influence the quality of an Abstract Test Suite. The above 
defined groups will be referred from now on as GQR 
(Group of Quality Rules). The grouping of rules is very 
helpful to the assessment process, since each GQR affects 
one ore more quality criteria and there is no need to map 
each rule separately to the criteria. The only requirement is 
to specify the quality criteria which are influenced be each 
GQR. An example of a GQR is illustrated below 


Rules for the GQR TIMERS 


1. There must be a TIMEOUT event for every timer. 
here must be a CANCEL operation for every timer, or 
the CANCEL operation with the timer’s name omitted 
(all running timers become inactive). 


IV. TTCN METRICS DEFINITION 


The TTCN quality rules, in text format, can not be used in 
an automatic quality assessment process. Only a manual 
quality assessment process can be based on text. For the 
automatic process only numbers can be understood by the 
computer._Thus, the next. step to the automatic quality 
assessment process, is the extraction of metrics for the 
TTCN quality rules. Metrics are static measurements 
obtained without executing the ATS. 


For the quality assessment of ATSs, two types of metrics 
will are proposed. There are rules that can be translated into 
metrics, by measuring only one feature of the ATS. Based 
only on this measurement, an assessment for the quality of 
the ATS can be decided. That is the first type of metric. 
Besides these, other type of rules can not be translated into 
a metric, by measuring only one feature of the ATS. The 
metrics of these rules, is the products of the comparison of 
two or more features of the ATS. The outcome of these 
comparisons is usually an average number, which can lead 
to quality conclusions for the ATS. That is the second type 
of metric. 


For every GQR, a set of metrics is derived. The definition 
of a metric is explained in the following example. 


Name : NUCT (Number of UNCANCELED Timers) 


Description: The total number of uncanceled timers in the 
ATS. 


Application: This metric belongs to the GQR_TIMERS 
and concerns the proper use of timers in the ATS. It must 
be used during the implementation phase, in order to make 


sure that all the timers used in the ATS are canceled after 
their use. 


Extraction procedure: The total number of uncanceled 

timer in the ATS is: 

NUCT=(NOTM) — (NOTM_O)-—(NOCT). 

Where: 

e NOTM is the number of timers used in the ATS metric. 

e NOCT is the number of CANCEL timer statements in 
the ATS. 

e NOTM_O is the number of timers used before a plane 
CANCEL. 


Criteria: This metric influences the completeness and 
instrumentation of the ATS. 


Interpretation & Action: All timers must be canceled. A 
timer, which is not canceled, indicates a serious error. The 
value of this metric must be zero. 


V. DEFINITION OF LIMITS FOR THE TTCN METRICS 


The quality equations for the automatic quality assessment, 
as described in the following section, can not use the 
extracted metric’s values as they were produced by the 
previously described method. These values must be 
formatted to fit in to a specific range in order to allow a 
unified flow to the structure of the equations. The extraction 
of these values is based on the actual values of the metrics. 


A normalised procedure which in fact is a mapping between 
the actual values of the metrics and values within the range 
[0 , 1] is reqired, (If the actual value is not already within 
this range). This can be accomplished by defining specific 
limits for each metric mapped on a value between 0 and 1. 
The comparison of the actual value with specific limits 
leads to the extraction of the value that the metric has 
acquired in the quality equations. 


The metric limits can also be used for the quality 
designation of each metric separately. The quality 
assessment is the product of the comparison of the actual 
value of the metric with its quality limits. The total quality 
evaluation of an ATS can not provide detail information for 
the quality. The quality designation of each metric 
separately allows the detection of the strong and weak 
points of an ATS. Thus, for the quality designation and the 
extraction of the value that each metric has in the quality 
designation the following Table I was proposed. 


Table I. The proposed quality desigantions relative to the 
equation values 


Name of the metric 


Quality Limits 
Equations 

| ACCEPTED | MINI-MAXI | 1 

| COMMENT | MIN2~MAX2 | 08 | 

in aeay ae 


. 
. 


Designation 


INSPECT MIN3—MAX3 | 06 | 
TEST MIN4 — MAX4 


REWRITE MIN 5— MAXS5 
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VI. DEFINITION OF THE MATHEMATICAL EQUATIONS 


The automatic quality assessment process is based on the 
relationship between TTCN quality factors, criteria, GQRs 
and metrics, which are expressed in the form of 
mathematical equations. The following example explains 
the form of these equations and the relationships between 
them. 


Scenario of the example: 
The example model has three quality factors 


Each factor splits into a number of quality criteria 
The criteria are influenced by Group of Quality Rules 
The GQRs are translated into metrics 


The analysis of the first factor is illustrated in Figure 2, in a 
form of a quality assessment tree. 


Extraction of quality equations: 


Factor_1 =(F1_1 x Criterion_1) + (Fl_2 x Criterion_2) (1) 
Criterion_1 =(C1_1 x GQR_1) +(C1_2 x GQR 2) (2) 
Criterion _2 =(C2_1 x GQR_3)+(C2_2x GQR 4) (3) 
GQR_1 =(G1_1 x Metric_1) + (G1_2 x Metric_2) (4) 
GQR_2 = (G2_1 x Metric_3) + (G2_2 x Metric_4) (5) 
GQR_3 = (G3_1 x Metric_5) + (G3_1 x Metric_6) (6) 
GQR_4=(G4_1 x Metrics _7)+(G4_2x Metrics 8) (7) 


The equation components “Metric” are arithmetic values 
within the range [0,1]. The extraction of those values for 
each of the TTCN metrics was described in the previous 
paragraph. 

ANSE TMI Et Pose 2 oe eel. Ieeted oe OD eee? 
GPT G12 G28 Ty GZE2 BGS 01, Gs: G4) 1 G4i Oiare 
quality weights, raging between zero and one [0,1]. 


GQR_1 


GQR 2 


GQR 3 


GQR 4 


The total quality assessment can not be calculated as the 
mean value of the quality of each of the quality assessment 
components. The metrics, which a GQR translates into, do 
not influence the quality of this GQR in the same degree. In 
the same way the GQRs do not influence the quality of a 
quality criterion in the same degree. The quality criteria do 
not influence the quality of a quality factor in the same 
degree, and not all the quality factors have the same weight 
in the total quality assessment of an ATS. The weights used 
in the quality equations express these degrees of influence 
in the form of arithmetic values within the range [0,1]. 


The values of the weights are defined in a way, that the sum 
of all the weights in an equation equals one (1). Since the 
equation components “Metric” have values within the range 
[0,1] and the sum of all the quality weights for each 
equation is equal to one. The outcome of all the equations is 
also within the range [0,1]. 


Following the same procedure for the other two factors, the 
equation for the total quality assessment is presented in 


equation (8). 


TOTAL =(T 1 x Factor, 1) + (ie2x Factora2) ce 
+ (T_3 x Factor_3) (8) 


Where, T_1, T_2, T_3 are also quality weights but in this 
case all within the range [0,100]. 


The sum of the quality weights for the total quality 
assessment is equal to 100. Thus, the range of 
values for total quality assessment equation is [0, 100]. 


For the total quality assessment of the ATS, the outcome of 
equation (8) is compared with the values of the following 
tablell. 


Metric_1 


Metric _2 


Metric 3 


Metric 4 


Metric_5 


Metric_6 


Metric_7 


Metric 8 


Figure 2 : Quality assessment tree of Factor_1 
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Table II Comparison values for the total quality assessment 


ATS Quality 
Designation 


ACCEPTED 


of an ATS 


VH. CONCLUSIONS 


For the quality assessment of ATSs only EWOS 
({12],[13],[15]) has defined a manual method (as far as we 
know), based on checklists. The proposed automatic 
methodology is a step forward compared to the existing 
manual method, in which the user must first fill a big 
number of lists and then manually process the information. 
The automatic method defined in this paper is expected to 
solve the applicability problems of met in manual methods 
like the required time and effort as well as the possibility of 
human mistakes. 
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Cost Effective Deployment of Service Logic in 
Future Distributed Processing Environments 


Anastasia K. Kaltabani, Evangelia C.Tzifa, Panagiotis P. Demestichas, and Miltiades E. Anagnostou 


Abstract-- The success of service providers in the competitive 
communications market of the future will be determined by 
the efficiency with which the services will be developed, and 
the quality levels at which the services will be provided. A 
service creation environment (SCE) should provide the service 
developer with solutions to these problems. In this paper we 
present an advanced SCE focusing on the service creation 
practice, the support of standardised service architectures, 
through libraries of service components, the provision of QoS 
handling related facilities and a tool for the optimisation of the 
service logic deployment scheme supporting the deployment of 
component-based distributed applications. 


Index Terms-- Service Creation, TINA, DPE, Deployment 
Patterns, Optimisation Problems 


I. INTRODUCTION 


The success of telecommunications service providers in the 
demanding and free market of the future will depend on the 
rapid creation of the pertinent services as well as their cost- 
effective provision at adequate quality levels [1,2,3]. In 
principle, the main concern of all the players in the 
competitive environment will be to constantly monitor the 
user demand, and in response to create, promote and 
provide the desired services and service features. The 
challenges outlined above have brought to the foreground 
several new important research areas. Some of them are the 
definition of new business models, the specification of 
service architectures (SAs) [4,5,6,7,8,9], the development 
of service creation environments (SCEs), e.g., 
({10,11,12,13], and the exploitation of the advances made in 
the software technology area [14,15,16]. Some essential 
factors for success are the following. First, the efficiency 
with which services will be developed. Second, the Quality 
of Service (QoS) levels, in relation with the corresponding 
cost, at which the new services will be provided. Third, the 
efficiency with which the services will be operated 
(controlled, managed, etc.). In general terms, the scope of 
this paper is in line with the cost-effective QoS provision 
and the efficient service creation and operation objectives. 
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More specifically, it describes a facility of an SCE that is 
targeted to the optimisation of some of the factors that will 
determine the quality levels at which services will be 
provided. 


According to the SA concept, the logic of a service is 
attributed to a set of autonomous, co-operating components. 
Service components (SCs) interact through middleware 
functionality that runs over Distributed Processing 
Environment (DPE). Middleware functionality is assumed 
to abstract from the service code the distributive nature and 
the heterogeneity of the underlying networks as in the case 
of the CORBA (Common Object Request Broker 
Architecture) compliant platforms [14]. The 
Telecommunications Information Networking Architecture 
(TINA) will be used as a reference SA in the rest of our 
paper. However, our approach and results may be applied to 
other models as well. 


An SCE should provide the service developer with the 
means to address the essential challenges in service 
creation. To this end, an SCE may be generally structured 
in three blocks: The service logic development tools that 
provide the means (software technologies) for the analysis, 
design, implementation and validation of the service logic; 
The service component library that provide reusable 
components; The QoS related facilities which provide the 
means for the cost-effective provision of services at the 
appropriate quality levels. 


An important QoS dimension that should be addressed by 
the QoS related facilities is the speed with which the service 
logic accomplishes its tasks, and especially, those that are 
perceived by the users. An important factor that will 
influence this aspect is the arrangement of the service logic 
(components) in the target environment. As already stated, 
a service is made available to users through a DPE. This 
means that when users request the various service features 
the functionality of various components, which are 
deployed somewhere in the DPE, is invoked. The service 
nodes (SNs) are physical (network) elements of the DPE 
wherein SCs may be installed. Each SN has certain 
capabilities, e.g., the maximum load it can undertake while 
keeping the performance within the acceptable range. 
Moreover, cost factors can be associated with the fact that a 
SC is deployed at a SN. These facts yield that SCs should 
be deployed in carefully selected SNs, taking into account 
the entailed cost and the QoS related criteria. Therefore, an 
optimisation problem is required so as to determine the SNs 
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wherein SCs should be deployed, and a tool is required as 
part of an SCE, for assisting the service developer in 
making the appropriate deployment pattern. The 
presentation of such a problem and respective tool is the 
subject of this paper. 


The rest of this document is structured as follows: In 
Section II, the elements of TINA SA framework that are 
related to our approach are revisited. Section II provides 
the problem description by giving its high-level 
representation and a formal description. Thereafter, in 
Section IV the mathematical formulation of the problem is 
given while in Section V a problem solution in terms of 
efficient algorithms is proposed. Finally, in Section VI 
some concluding remarks are made. 


IJ. ELEMENTS OF THE TINA SERVICE ARCHITECTURE 


A. TINA Session Concept 


A fundamental TINA concept is that of the session [3,5]. It 
was introduced for depicting the temporary relationship 
among a group of objects that are assigned to collectively 
fulfil a task for a period of time. Five session types have 
been defined, namely access, user, service, communication, 
and connectivity. The paper will be mainly concerned with 
the service, user, and communication sessions. 


The service session provides the context, in which the core 
of service execution takes place. Moreover, it has the global 
information regarding the participants and the logical 
connectivity, while each user session contains the subset of 
the information that refers to a specific participant. Finally, 
the communication session provides the means of 
controlling (establishing, modifying, releasing, etc.) the 
underlying transport network connections and resources. 


B. TINA Service Components 


In the rest of this sub-section we introduce the relevant to 
our study TINA concepts. In specific, we group the relevant 
to our approach TINA service components in respective 
sets according to the session they belong to. In our scenario, 
the components that are related to the terminal are grouped 
in a set, called Terminal (user site) Related Components 
(TRC). Furthermore, we have two more sets; the user 
session components and the service session components 
called User Session Related Components (USRC) and 
Service Session Related Components (SSRC), respectively. 


The Service Session Related Components (SSRC) in the 
retailer domain are the Service Session Manager (SSM), 
and the Service Factory (SF). Briefly, the SF is responsible 
for the creation of the SSM and the USM. The SSM is the 
component that has a global view of the service session. In 
summary, it keeps track of the status of the users 
(participants) in a service session and also the status of the 


logical connections among the different users. The User 
Session Related Component is the User Session Manager 
(USM). The USM represents a user in a user session. The 
information that is comprised is similar to the one included 
in the SSM but limited to the specific user that it represents. 
In the terminal the user is confronted with the service 
session User Application (ss-UAP). 


III. PROBLEM DESCRIPTION 


A. High-Level Problem Representation 


In this subsection we provide a high-level description of the 
problem for the optimal allocation of service components to 
the proper service nodes of the network, i.e., creation of an 
appropriate deployment pattern. The tool for the Cost 
Effective Deployment of the Service Logic exploits the 
following inputs. First, elements of the service design, to 
obtain inter-component dependencies. Second, the 
estimated service and service feature demand volume. 
These two inputs are the basis for the computation of 
measures related to each component’s popularity (e.g., 
frequency of invocations, volume of information 
exchanged). The third input refers to the service node 
characteristics, for example their capacity and the cost of 
involving them in the solution. Finally, there are 
performance constraints that the service logic should meet. 


B. Formal Problem Description 


A general problem statement may be described as follows: 
given a set of Client Sites (ie CS) that request a service 
through a DPE, two sets of service components 
(u €USRC,s € SSRC ), the respective load they impose on a 


service node (, ,,) (calculated from the rate at which the 
components are being invoked and the computational effort 
that each invocation imposes on the service node), a set of 
service nodes (jeSN) and their capabilities (e.g., 
maximum load that each one may undertake, » , (j)), find 


the optimal placement of components to service nodes that 
results in the minimization of a cost function (associated 
with the cost of involving each service node in the solution, 
maintaining the service component on a service node and 
the communication cost among service components that are 
deployed in different service nodes, etc.), and at the same 
time preserves the set of constraints, which are related to 
the capabilities of the service nodes and some specific 
performance constraints. 


IV. FORMULATION OF THE PROBLEM 


The objective of this section is to evaluate the 
communication costs among the several sets of service 
components (USRC, SSRC) that have been defined in the 
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service logic and will be deployed on ‘different service 
nodes during the application of the Service Logic 
Deployment. In specific, the goal is to find the nodes on 
which USRCs may be deployed (N,, c SN), as well as the 
nodes on which SSRCs may be deployed (N, c SN)- 
Moreover, the allocation of CSs to SNs denoted as A; 
(A; c CS) where A; is the set of CSs whose demand for 
USRCs is satisfied by j « N,, and the allocation of SNs to 
SNs denoted as Ay (A, C N,) where is the set of SNs 
whose demand for SSRCs is satisfied by k « N, must be 
found. A general view of the deployment pattern is depicted 


in Figure IV-1. 
me USES a 


Se) 


Figure IV-1: Service Logic Deployment Pattern 


The proper deployment of the service logic in the target 
environment may be formulated as an optimisation problem 
[17]. To this end, an equivalent linear programming 
problem for determining the optimal deployment of 
components to service nodes may be the following: 


Minimise: 

Csy ° Vv; + 

DL Drelsh dul Dali 6(0) mi 
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The cost function comprises the cost imposed by the 
deployment of the service nodes that are involved in the 
solution, the communication cost among components that 
are deployed in different service nodes, and the cost of 
maintaining the components in the service nodes wherein 
they will be deployed. The set of constraints guarantees that 
the load on each SN is such that its performance will be 
maintained within an acceptable range. 


The cost of maintaining components on a service node is 
assumed to be a function of the load that each SN will have 
to cope with. Therefore, the aggregate load that is 
associated with each CS-i, which may be either invitee or 
invited or both, will be imposed on the SN that will control 
CS-i and corresponds to the invocations of the operations of 
the USRCs (deployed in /) by the TRCs in CS-i. Moreover, 
the load that is associated with each SN (j € N,,) will be 


imposed on the SN-k that will control SN7j and corresponds 
to the invocations of the operations of the SSRCs (deployed 
in k) by the USRCs deployed in J). 


The communication cost between components deployed on 
different SNs depends on the volume of information 
exchanged during the invocation of methods 
(meM,,meM,) provided by the USRCs or the SSRCs as 


well as the frequency of invocations of each one of these 
methods. Furthermore, this cost is based on the number of 
USRCs and SSRCs and the distance between the nodes 
where the different types of components will be deployed. 


V. PROBLEM SOLUTION 


A significant issue, apart from the problem formulation, is 
the selection of the solution method. Specifically, integer 
linear programming belongs to a class of problems that fall 
into the NP-complete [17] category. The usual next step is 
to devise a computationally efficient near-optimum 
algorithm ‘providing good (near-optimal) solutions in 
reasonable time. Major alternatives ate simulated annealing 
[18], genetic algorithms [19], or more simple “greedy” 
techniques [17]. 


In the context of this paper we devise a simulated 
annealing-based algorithm. In specific, the solution of the 
problem is divided in an initial and a final phase. During the 
first phase, a greedy technique is used, where at each stage 
the best solution is selected. Firstly, the client sites in which 
TRC are deployed are sorted with respect to their service 
rate invocation. Starting from the most demanding client 
site, the load originating from the USRC and the SSRC is 
accommodated by the service nodes which yields minimum 
cost in the objective function (cost function). By repeating 
the same procedure for all the Client Sites an Initial 
Assignment is found. 


During the second phase of the solution a simulated 
annealing-based algorithm is applied. The objective is to 
find an improved assignment of service components to 
service nodes. To this end, the new solution that may be 
found is to assign the load originating from each service 
component to one service node which is different from the 
one currently assigned. The Initial~ Assignment is 
considered to be the starting point. The initial temperature 
designates the number of iterations of the algorithm. Then, 
a new solution that is neighbouring to the current one is 
found and the cost difference of the two solutions is 
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computed. If the cost of the new solution is smaller than the 
current one, then the new solution is accepted and becomes 
the current best solution. Otherwise, the new solution is 


accepted based on the probability e %'CT) 5 pand(0,)), 


where Ac is the cost difference and CT the current 
temperature. This approach is followed in order to escape 
from locally optimal points. Neighbouring solutions are 
selected randomly among all the possible neighbours of the 
current best solution. Then, the cooling schedule is applied, 
in order to calculate the new temperature. If the current 
temperature is zero, or if the stop criterion is satisfied, the 
procedure ends and the algorithm ends. Note that there are 
various alternatives for realising the stop criterion. In this 
version, the algorithm stops when no improvement has been 
made after a given number of iterations of the algorithm. 


VI. CONCLUSIONS 


This paper addressed a tool regarding the proper 
deployment of the service logic on the target environment. 
Our approach spanned over the underlying practice, the 
software technologies comprised, libraries of reusable 
components and tools for optimising the QoS levels at 
which the services will be provided. 


Based on the TINA Service Architecture and given a set of 
Service Components, the load they impose on a Service 
Node, the set of Service Nodes and their capabilities, the 
objective was to find an optimal allocation of Service 
Components to Service Nodes by minimising a cost 
function subject to a set of performance constraints. This 
problem was optimally solved by reducing it to a linear 
programming problem. Since the latter is computationally 
complex, a simulated annealing-based algorithm was used. 
The outcome of the applied algorithm is the optimal 
deployment of Service Components on Service Nodes. 
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A Two-bit Scheme for Service Differentiation in 
IP Networks 


Anton Kos, Student Member, IEEE, Sa8o Tomazit, Member, IEEE 


Abstract--This paper presents possible approaches to traffic 
and service differentiation in IP networks and interesting 
simulation results for the proposed two-bit service 
differentiation scheme. Currently there are two major 
approaches to introduction of Quality of Service (QoS) to IP 
networks. Integrated Services (IS) with RSVP rely on explicit 
flow-by-flow reservation of network resources, and 
Differentiated Services (DS) that make use of a relative traffic 
differentiation based on priorities. Our work is focused on 
evaluation of properties of IP networks, which use packet 
priorities for securing different levels of QoS. Our two-bit 
scheme for service differentiation allows up to four priority 
levels. For evaluation purposes we chose four applications with 
different types of traffic and granted them different priorities. 
First we simulated current best-effort scheme and then 
compared it to our two-bit priority scheme under the same set 
of conditions. Simulations have uncovered many interesting 
results and justified the implementation of differentiated 
services into current best-effort IP networks. We conclude that 
introduction of even simple priority schemes into best-effort 
IP network, can significantly improve its performance. 


Index Terms--service differentiation, IP QoS, traffic priorities 


I. INTRODUCTION 


Recently a number of proposals have been made concerning 
the addition of enhanced services to the Internet. To date 
two major approaches to introduction of Quality of Service 
(QoS) to IP networks have emerged: connection oriented 
Integrated Services (IS) with RSVP signaling rely on 
explicit flow-by-flow reservations of network resources, 
and connectionless Differentiated Services (DS) that make 
use of a relative traffic differentiation based on priorities 
that are included in the header of every packet. 

While adoption of IS implies changes in all network nodes 
and devices, DS require changes manly on the boundaries 
of a network. With the later it is also important that it does 
not require changes to user applications that are already 
running today. 

Schemes using a single bit priority filed in IP packets have 
been proposed as a low-cost way to augment the single 
class best effort service model of the current Internet to 
include some kind of service discrimination. Our work is 
focused on evaluation of properties of IP networks, which 
use packet priorities for securing different levels of QoS. In 
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this paper we examine a two-bit scheme for service 
differentiation and we use simulations to evaluate delay and 
throughput characteristics of the proposed scheme. 


Il. SERVICE DIFFERENTIATION 


Lately QoS has been a buzzword in the world of 
(tele)communications and to insure its different levels some 
sort of traffic differentiation must be introduced. There has 
been much debate about introduction of QoS and many 
solutions and schemes have been proposed to upgrade 
current best-effort service of existing IP networks. Service 
differentiation would enable introduction of new services 
that are not possible in today's best-effort networks and 
would also bring new possibilities and benefits for existing 
users and applications. 

The most straightforward approach is labeling packets with 
information on their importance, which is carried in their 
headers. In that way we can control the delay and 
throughput characteristics up to some point. Flows with 
higher importance get better service (lower delay and 
higher throughput) and flows with lower importance get 
worse service (higher delay and lower throughput). 

The simplest way of doing that is to have only one bit for 
QoS information. That means only two kinds of traffic: low 
and high priority. One-bit schemes for service 
differentiation can be found in [2] and [3] and are popular 
for their simplicity. But with ever increasing demands, both 
from the user and network management sides, the question 
arises: can this kind of service differentiation satisfy their 
wishes and demands. 

In this article we propose, simulate and evaluate a two-bit 
scheme for service differentiation in IP networks, which is 
an upgrade from a one-bit scheme presented in [3]. 


Il. A Two-Bit SCHEME FOR SERVICE DIFFERENTIATION 


Introduction of differentiated services with assigning a 
priority level to each packet, which is usually done on 
network boundaries, means that network nodes will have to 
discriminate between packets using more sophisticated 
queuing disciplines. 

In the proposed two-bit scheme we chose delay as the basic 
QoS parameter. We assign priorities to packets depending 
on application demands for delay that can be experienced 
by certain service. 

Four levels of traffic (packet) priorities are possible in our 
scheme that is implemented as four separate queues in 
network elements. Priority 4 traffic is put in the highest 
priority queue and priority | traffic in the lowest priority 
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queue. Queues are then serviced in the "strict priority" 
fashion. Highest priority queue is serviced first and is 
continued to be serviced until there are packets in it. 
Priority 3 queue is serviced next but only if there are no 
packets in priority 4 queue. Consequently, priority | queue 
is serviced only when all the higher priority queues are 
empty. 

In the proposed scheme with four priority levels we assign 
highest priority to delay critical traffic and lowest priority to 
delay tolerable traffic. 


IV. SIMULATIONS 


We have chosen four different applications with different 
delay demands: speech, video, WWW and FIP. The most 
delay critical is transmission of speech and therefore gets 
the highest priority (4), video gets priority 3, browsing the 
web priority 2 and file transfer the lowest priority (1). 

In simulation we have used a simple network topology with 
two routers which is shown in Figure 1. 


Node 2 


Figure 1: Simulation network topology 


For simulation purposes we have used queues with a large 
buffer space so that it can not overflow and packet discard 
does not occur. As already explained we used strict priority 
service discipline with high priority queue being serviced 
first until empty and then moving to the next lower priority 
queue. Inside each queue packets are served on FIFO 
principle. Our discipline is also non-preemptive, which 
means that packet arriving in higher priority queue can not 
disrupt processing of the packet in lower priority queue that 
is being serviced at that moment. Traffic characteristics for 
four applications are listed in Table 1. All values are given 
in bytes and do not include headers of protocol stacks. 


[Property _|_ Speech _| Video _| WWW/FTP 
[Min.datalength | 30 | 640_-+| 10 
Max. datalength | 30 | 640 | 1400 


Packet length , 

phate. - Uniform 

distribution ee ee ee 
Packet arrival Constant time : 

ss ; Exponential 
distribution intervals 


Table 1: Traffic characteristics 


Network nodes were connected with a point-to-point 
connection with a bit rate of 2048 kb/s. Speech traffic 
consisted of flows with a bit rate of 8 kb/s (coded speech) 
and video traffic consisted of flows with a bit rate of 128 
kb/s (coded video). Both were using the unreliable 
RTP/UDP/IP protocol stack. WWW and FTP traffic were 
not divided into flows, as it was necessary for speech and 
video, and were using the reliable TCP/IP protocol stack. 
FTP files were 100 kB long. 

We have varied traffic characteristic in order to achieve a 
total of 50%, 90% and 100% connection load. Under the 
same conditions we have simulated the best-effort (BE) 
scheme (no prioritization) and two-bit priority scheme for 
differentiated services (DS) with four priority levels. In that 
way we got six simulation scenarios labeled BESO, BE90, 
BE100, DS50, DS90 and DS100. For each of the scenarios 
we have simulated different shares of a real-time (RT) 
traffic (speech and video) ranging between 10% and 90% 
with the step of 10%. 

Simulation time was set at 100s and with each simulation 
we have supervised the total connection load, share of each 
traffic type, amount of data transferred, delay characteristics 
and possible packet discards. 


V. SIMULATION RESULTS 


Our main and primary concern were delays of the four 
traffic types which includes only queuing delay in network 
nodes and omits multiplexing and propagation delay on the 
network side and packetizing and possible encoding and 
buffering (jitter compensation) delay on the user's side of 
the connection. From the simulation results we can draw 
out some important conclusions: 

As expected, with introduction of DS the most gains 
priority 4 speech traffic. When average delays for BE and 
DS scenarios in Figure 2 are compared it can be seen that 
the highest gains are at 100% load where delays fall 
drastically. But there is considerable improvement even at 
90% load what can be observed in Table 2 and Table 3. The 
interesting result here is that average delays fall with 
increasing share of RT traffic (speech and video). That can 
be explained with lower share of non-RT traffic, which has 
in average longer packets. Since our scheme is non- 
preemptive speech packets must wait for those packets to 
finish transmission and consequently experience longer 
average delays when there is higher share of long packets. 


[RT share | 10 | 20 | 30 | 40 | 50 | 60 | 70 | 80 | 90 | 
[ps90_ [2.3] 2.2[ 2.1] 2.12.0] 1.9] 1.8117] 1.6 | 


Table 2: Average delay for speech at 90% load 


(RT share | 10 | 20 | 30 | 40] 50 | 60 | 70] 80 | 90] 


Table 3: Maximum delay for speech at 90% load 
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Figure 2: Average speech packet delay (ms) 
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Figure 3: Average video packet delay (ms) 


Very similar but a bit less drastic are results for priority 3 
video traffic that are shown in Figure 3. The results are 
again best visible at 100% connection load where gains are 
considerable. At lower loads gains are less visible but still 
important. 

Priority 2 WWW and priority | FTP traffic are the "losers" 
of our scheme. With introduction of DS they both 
experience higher delays, what can be clearly seen in Figure 
4 and Figure 5. The increase in average packet delay is 
especially high at 100% connection load but can be also 
nicely seen at 90% load. 

While in DS scheme FTP experience higher delay 
independently of the share of RT traffic, WWW DS scheme 
delays overcome BE scheme delays only at 50% share of 
RT traffic. The reason lies in the fact that at low RT traffic 
shares WWW can make use of its priority over FTP traffic. 
Let us now compare average delays for DS100 and BE100 
scenarios for all four priorities that are shown in Figure 6 
and Figure 7. It can be seen that average delays in BE100 
are the same for all traffic types, what was of course 
expected. All packets are equal and are not discriminated in 
any way, so they must have the same delay characteristics. 


90 


Figure 4: Average WWW packet delay (ms) 
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Figure 5: Average FTP packet delay (ms) 
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Figure 6: Average delay for DS100 scenario ( ms) 


In DS100 scheme priorities take their toll on WWW and 
especially on FTP traffic while video and speech improve 
their delay characteristics considerably. If average delays at 
90% of RT traffic are compared then it can be seen that 
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speech has delay 1.7 ms, video 5.5 ms, WWW 8 s and FTP 
11.3 s (Figure 6). Huge differences. 

Another interesting result is shown in Figure 8 where 
throughputs between BE and DS schemes are compared. An 
encouraging result shows that at 90% connection load the 
throughput for FTP traffic has practically not changed, it 
remains at almost 100% of its BE value. At 100% load the 
throughput of FTP is falling quickly with increase in the 
share of RT traffic. 


60 Speech BE100 


80 


Figure 7: Average delay for BE100 scenario (ms) 


FTP DS90/BE90 


60 FTP DS10W/BE100 


70 


90 


Figure 8: Throughput ratio DS/BE at 90% and 100% load 


In all of the above figures a share of RT traffic is depicted 
on x-axis and delay in ms on z-axis. 

Because of the non-preemptive nature of our scheme the 
maximum allowable packet length influences average 
packed delay. In our case the maximum packet length was 
1460 byte. It must also be noted that the increase in RT 
traffic was not continuous but in steps that were defined by 
the bit rates of speech and video flows. 


VI. CONCLUSION 


In the proposed two-bit scheme for service differentiation in 
IP networks queues are served first by priority and inside 
each priority queue the FIFO principle is used. The 
weakness of this scheme is that it does not offer isolation of 


flows within the same traffic type (priority). That means 
that all users that, for instance, browse the net experience 
the same average delay. 

Proposed scheme offers almost predictable service for the 
traffic that falls into the same priority level. If we specify 
delay as the primary criterion for service differentiation 
then the proposed scheme offers good differentiability and 
at the same time does not greatly change the throughput, 
except for the lowest priority traffic at congestion times 
when it is displaced by higher priority traffic. 

From the simulation results it is clearly evident that the 
proposed two-bit scheme offers good service differentiation 
between traffic types in conditions of medium and heavy 
loaded network. In the case of lightly loaded network there 
are no important differences between both schemes. At the 
same time with assuring low delay conditions for delay 
critical real-time applications there are no noticeable 
decrease in throughput for lower priorities (except at 
congestion times). Delays for low priority traffic jump to 
the skies but that does not severely affect FTP performance 
(in our case), which is resistant to delays. 

But average packet delay and throughput are only two of 
the criteria, which can define QoS. There are many others. 
For real-time traffic maximum delay and delay variation 
(jitter) are almost equally important and should not be 
neglected. 

We can conclude that the proposed two-bit service 
differentiation scheme can significantly improve conditions 
for the delay critical traffic without severely affecting 
throughput for other traffic and can help with introduction 
of new and improving of existing services. 
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AIM Interworking Unit with ISDN and Ethernet Interfaces 


Joaquim Neves Member, IEEE 


Abstract-- The convergence of telecommunications and 
computer communication technologies requires the _ 
development of Interworking Units to convert different data 
formats, protocols and control procedures. This paper 
describes the architecture of an electronic device, which is 
able to perform interworking functionalities between ATM, 
ISDN and Ethernet networks and presents several scenarios 
for interconnection environments. 


Index terms-- ATM, ISDN, Ethernet, LANE, Interworking. 


I. INTRODUCTION 


Digital communications environments are generally 
characterised by two main support technologies. By one 
side, on computer oriented communications, Local Area 
Networks (LAN), supported by Ethernet or Token Ring 
specifications, based on data packets, are widely used to 
interconnect locally or remotely computers and terminals. 
By other side, Public Switched Networks (PSN), based on 
data circuits, such as the Integrated Services Digital 
Networks (ISDN), have not only capabilities to 
interconnect a large variety of terminal equipment, but also 
to interconnect remotely several LANs, through switched 
or permanent constant rate links. 


The introduction of the Asynchronous Transfer Mode 
(ATM), which is neither circuit nor packet oriented 
transmission protocol but a conjunction of the best of the 
both techniques, permits the convergence of routing and 
switching technologies and the specification of new 
communication platforms, supported by different interfaces 
and transfer protocols. 


The integration of this emergent technology, within 
telecommunication and computer networks, requests the 
development of special adaptation units with interworking 
functionalities. The compatibility between different types of 
services requires the development of adaptation protocols 
for data presentation, at the application level, namely for 
non-ATM, narrow or broad band services. 
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On the other end, the compatibility between distinct 
systems forces the specification of in common physical and 
logical interfaces and the development of Interworking 
Units (IWU), to adapt different transmission, switch and 
flow control protocols. 


This paper presents the architecture of an electronic device, 
which is able to handle the interworking functionalities 
between ISDN, ATM and Ethernet environments, and 
discusses the integration of such device on different 
network scenarios. On the next section, it is presented the 
architecture of the Interworking Unit while the section tree 
characterizes the main interfaces of the system. Section 
four describes some interconnection scenarios and the 
section five summarizes the main conclusions. 


Il. SYSTEM ARCHITECTURE 


Since the main objective of the ATM Interworking Unit is 
the conversion of ISDN and Ethernet traffic to the ATM 
formats, most conversion procedures have to be located on 
lower layers of the OSI protocol reference model, namely 
the Physical Layer and the Medium Access Control Layer. 
On the upper layers, the information flow generated by 
network applications can be carried out transparently, 
through the IWU, over different transmission or network 
protocols such as IP, IPX, NetBEUI, AppleTalk or 
DECNet. 


The figure 1 illustrates the interconnection of Ethernet, 
ISDN and ATM terminals through the Interworking Unit. 
The hardware architecture of the proposed IWU is based on 
a central processor, which manages the data transfer 
between different line interface modules and the memory, 
as is sketched on the bloc diagram of the figure 2. 


Ethernet 


Interworking 
Unit 


Figure 1: - Interconnection of ISDN ATM and Ethernet 
Terminals through the Interworking Unit. 
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While each line interface module performs the line 
termination and the associated transfer protocols for each 
network environment, the processor performs the protocol 
conversion between different environments. 


The proposed Interworking device has two data buses: the 
Utopia Bus [1], which sustains the cell transfer between 
line interface modules; and the processor bus, which allows 
that ATM cells, ISDN channels and Ethernet packets, 
maybe transferred between line interfaces, the processor, 
and the memory. 


The processor unit, based on the Motorola PowerQuicc [2] 
architecture and the Tornado operation system, is able to 
support real time applications. 


Processor 
Unit: 


Ethernet 
IF EF 


Figure 2: - Hardware Architecture of de Interworking 
Unit. 


The adaptation of data formats and protocols are 
implemented at the ATM Adaptation Layer (AAL) [3], by 
mapping the Ethernet packets or ISDN stream into ATM 
cells, which are transferred on ATM Virtual Path or 
Virtual Channel Connections (VPC/VCC), over switched 
or semi-permanent circuits, according to ITU specifications 


[4]. 


This processor unit also has capacity, not only to dial with 
different data formats and protocols of the several network 
environments, but also to support ATM and ISDN 
signaling, together with LAN Emulation (LANE) [5] and 
Multiprotocol Over ATM (MPOA) [6] procedures. 


Il]. PHYSICAL AND LOGICAL INTERFACES 


Since the IWU have to convert different network 
environments, the proposed electronic device has to 
support the appropriated physical and logical interfaces to 
interconnect several types of terminals. 


A. ISDN Interfaces 


The Interworking Unit has two ISDN interfaces for the 
basic and the primary access, respectively. 


1) Basic Access 


The Basic Access Interface is specified at the reference 
point S, by several ITU recommendations [7]. 


With a transmission rate of 144 Kbit/s, S bus is able to 
interconnected at least 8 terminals, by sharing one or two 
64 Kbit/s time slots for data information transfer (B 
channel), while the signaling information is transported 
out of band on a 16 kbit/s D Channel. 


The Basic Access Interface can be used to interconnect 
ISDN terminals to Local Area Networks (LANs) over a 
Public Switched Network (PSN), trough one or more of the 
other interfaces of the IWU (i.e. the Primary ISDN access, 
the ATM interface or the Ethernet). 


2) Primary Access 


Physical and logical characteristics of ISDN primary access 
interface are specified by ITU [8]. 


While the basic access interface is able to interconnect 
ISDN terminals trough different LANs over the PSN, the 
primary access interface is appropriate to support the 
interconnection of different network environment, such as 
ATM or Ethernet. 


B. Ethernet Interface 


Since most of computer terminals communicate trough an 
Ethernet Interface, according to IEEE 802.3 standards, 
namely 10 Base T and 100 Base T specifications, the 
proposed IWU have to handle with these interfaces at 
physical and logical levels, to convert the Ethernet 
environment to ISDN or ATM specifications. 


C. ATM Interfaces 


The Interworking Unit has two ATM physical interfaces: 
one operates at 2 Mbit/s and the other operates at 25 
Mbit/s. 


1) ITU-T G804 
On the 2 MBit/s ATM interface, the cells are 


according ITU-T recommendation G. 804 [9] on the frame 
described by G. 703 [8]. This interface is appropriate to the 


———_ SS 
154 10” Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


interconnection of ISDN and ATM environments over the 
PSN, or to interconnect LANs trough ATM networks. 


2) ATM Forum 25.6 Mbit/s 


The ATM Forum 25,6 Mbit/s over twisted pair interface 
[10] is suitable to support the interconnection of ATM 
terminals to the ATM or LANE Networks. 


IV. INTERCONNECTION CENARIOS 


Most of standardised ISDN primary access interfaces, such 
as Wired, Optical or RF Links, as well as, Add and Drop 
Wired Rings, operating at 2 Mbit/s, are able to 
interconnect remotely the Interworking device to one PSN 
Local Exchange or to another IWU. Other standardised 
interfaces, such as the ISDN basic access, the 25,6 Mbit/s 
and the Ethernet are more suitable for terminal 
interconnections, within ISDN, ATM or Ethernet 
environments. 


The figure 3 sketches the interconnection of the IWU with 
ISDN, ATM or Ethernet networks, to perform remotely the 
interworking procedures between different type of 
terminals. 


Interworking 
Unit 


Figure 3: - Interworking between ISDN, LAN ant ATM 
Networks. 


The IWU have to dial to two main interworking scenarios: 
ATM/ISDN and ATM/Ethernet. In both cases, 


Management, as well as Operation and Maintenance 
(OAM) procedures, carry out by appropriated protocols 
[11], are also supported by the IWU. 


A. ATM/ISDN Interworking 


The bloc diagram of figure 4 presents the logical 
architecture of the [WU for the ISDN and ATM services 
conversion. For this propose, the [WU has to convert data 
formats between ATM and ISDN interfaces, as well as the 
signaling and management protocols, according the ITU 
specifications [12]. 


The information flow of ISDN services supported by High- 
Level Dada Link Control (HDLC) protocols, such as the 
Link Access Procedure Balanced (LAPB), the Link Access 
Procedure on D Channel (LAPD) and the Link Access 
Procedure for Frame-Mode Bearer Services (LAPF), are 
converting to ATM formats, trough the ATM Adaptation 
Layer (AAL) type 1 or type 3/4 [3], and are supported by 
ATM Layer Virtual Path or Virtual Channel Connections 
(VPC/VCC) [4]. The signaling information related to 
ISDN and ATM services is also converted on the IWU, 
according to ITU-T Q.931 and ATM Forum UNI 


specifications [13]. 
Higher Higher 
Layers Layers 
X.25 
0.931 


stele 
ATM LAPB-LAPD 


Frame Relay 


Physical 
Layer 


Physical 
Layer 


Physical 
Layer 


Figure 4: - Logical Architecture of the Interworking Unit 
for ISDN services. 


B. ATM/Ethernet Interworking 


The proposed IWU supports Routing, Bridge and Gateway 
functions on ATM/Ethernet environments. The conversion 
of ATM and Ethernet traffic formats and protocols, is 
made according the LAN Emulation specifications [5]. 


The IWU implements the LAN Emulation Sever (LES), 
which supports the ATM Address Resolution Protocol 
(ATMARP), the LAN Emulation User-Network Interface 
(LUNI) and the Interim Local Management Interface 
(ILMI). Figure 5 presents the IWU logical architecture for 
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LAN Emulation over ATM, according the AT™ Forum 
specifications. 


The IWU also support other LAN Emulation 
functionalities, such as Broadcast and Unknown Sever 
(BUS) and LAN Emulation Configuration Sever (LECS), 
to permit that ATM terminals, supporting the LAN 
Emulation Client (LEC) functions, may communicate with 
other Ethernet terminals, without the intervention of 
external equipment. 


Figure 5: - Logical Architecture of the Interworking Unit 
for LAN Emulation. 


Other standardized protocols, such as the Classical IP over 
ATM (IPOA) and the Multi-Protocol Over ATM (MPOA) 

[6] could also be implemented in the future, together with 

the Next Hop Resolution Protocol (NHRP) 

and the Next Hop Resolution Protocol Sever (NHS). 


Broadcast and multicast capabilities could also be 
supported with the implementation of Multicast Sever 
(MCS) and Multicast Address Resolution Sever (MARS) 
functionalities. 


C. Management Procedures 


The Interworking Unit carries out as well management 
procedures, supported by SNMP, CNM [14] or TMN 
protocols. In addition, Operation and Maintenance (OAM) 
procedures for ATM environment [11] are used to control, 
in a very flexibly way, some hardware capabilities, namely 
the remote configuration to execute all the possible ; 
combination of the interworking functions between ATM, 


VY. CONCLUSIONS 


In our days, the Asynchronous Transfer Mode (ATM) 
establishes, more than, the basis structure of the Broadband 
Integrated Services Digital Network (B-ISDN), a shared 
platform for the transmission and the switching over most 
of circuit or packet oriented networks. 


The Interworking Unit, presented on this paper, has a 
flexible architecture, based on a central processor unit, 
which control the information flux between the modular 
line interface broads through two independent data buses, 
and is able to processes, in real time, the protocol 
conversion between the involved data formats. This 
electronic device can be integrated on a wide verity of 
communication infrastructures and traffic environments, 
for interconnection of ATM, ISDN or LAN terminals, 
locally or remotely, over a public telecommunication 
network. 
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Modeling of Cell Delineation Procedure in ATM 
Network 


Marina Kopeetsky, Yury Zlotnikov 


Abstract—The purpose of this paper is research the 
synchronization properties of a Cell Delineation (CD) process 
in ATM networks. We demonstrate the CD process 
probabilistic model as a sequence of discrete states. 
Cooperation of these states is specified by a semi-Markovian 
process. One of the basic components of this process, that 
defines possibility of false per-cell synchronization, is the 
HUNT state. To analyze basic characteristics of the HUNT 
state, we create a Markov chain oriented model. This model 
takes into consideration the properties of the cell header error 
control (HEC) redundant cyclic code. Besides, we take into 
account two channel error models which are relevant to 
fiber-optic channels. The first model is the Bernoulli 
memoryless one. The second model corresponds to the two 
state simple Markov chain. We present a computer algorithm 
to evaluate analytically the probability of false synchronization. 
The numerical results are demonstrated and discussed. 

Index Terms— ATM networks, cell delineation process, cyclic 
codes, synchronization in networking. 


1. INTRODUCTION 


The Cell Delineation (CD) process is of a special 
interest in high speed ATM networks. This is because many 
different problems concerning execution and utilization of 
ATM networks are caused by cell synchronization failures. 
These failures are escalated to the higher layer protocols. 
The corresponding recovery procedures that are applied 
require a considerable expenditure of network resources. A 
concise determination of the CD characteristics is a serious 
problem nowadays since it is related to support of Quality of 
Service (QoS) characteristics. 

The CD process is structured as follows. Three 
discrete states are introduced which are classified as 
SYNCH, HUNT and PRESYNCH (Figure 1) [1,2]. The 
SYNCH state corresponds to a normal per cell 
synchronization at the receiver. It is characterized by a valid 
HEC. If a consecutive cells have been received with 
invalid HEC (per cell operations), the system goes to the 
HUNT state. The cell’s bits are shifted in the shift register 
one at a time in order to look for a position with a valid HEC 
(per bit operations). As soon as this is found, we follow to 
the PRESYNCH state, meaning that the cell’s correct 
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boundary has been tentatively located. If 5 consecutive cells 
have been received with this feature, the receiver returns to 
SYNCH state to start a normal operation. Otherwise, the 
receiver returns to HUNT state. A specific feature of the 
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correct HECs 
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failures 


Source of 
channel 
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Fig. 1. CD process structure 


HUNT state is the possibility of cell false synchronization. 
This happens when the HEC code generating polynomial 
G(x) = x°+x*+x+1 is not able to detect errors in the cell 
header when a cell’s non-synchronous state is being 
analyzed. As a result, an undetected error has taken place 
and, by this, a cell is received in a non-valid state. 
Definitely, probability of false synchronization depends 
upon properties of the polynomial G(x). False 
synchronization leads to incorrect treatment of cell 
information field by the upper layer protocols. Analysis of 
cell false synchronization probability is an important 
problem referring to error flow realistic models in the 
fiber-optic communication channels. We analyze two error 
flow models: the Bernoulli memoryless model and the 
two-state simple Markov chain memory model. 

Some problems related to structuring and implemen- 
tation of the CD process have been discussed previously. 
The HEC code generating polynomial G(x) and @ and 
values have been chosen by computer simulation in [3,4]. 
The decoding algorithm realization has been proposed based 
on implementation of coding/decoding operations of cyclic 
codes by means of the linear feedback registers [3,5]. 

A special model for error decorrelation in the header 
decoding process referring to CD has been proposed in [6]. 
However, up to now, no analytical research been performed 
in mathematical modeling of Hunt state and, by this, in 
quantitative evaluation of false synchronization probability. 
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2. MARKOV MODEL OF HUNT STATE 


Now we turn to creation of the Markov model of 
HUNT state and, by using methodology of Markov chains, 
to determination of mathematical expectation and 
variance of sojourn time in HUNT state. We intend also to 
obtain on upper bound of false synchronization probability. 
The probabilistic model includes a set {C} of 427 discrete 
states. These states are as follows. The initial shift from 
the correct synchronization boundary is a random variable. 
To take into account this feature, we have to introduce the 


initial state Cg. Really, this state is phantom. We assume 


that the components of the initial probabilistic vector (0) 
are distributed geometrically and shifts of few bits are the 
most probable. The state C, corresponds to cell correct 
synchronization. The state C, corresponds to cell synchro- 
nization with undetected error (false synchronization). 
These two states are essentially absorptive ones. Each state 
C, (=3...426) corresponds to a (i- 2) bit shift from the 
left correct boundary of receiver’s clock. By this, state 
C426 corresponds to the 424 bit shift from the left correct 
boundary. This means the false synchronization event. 

As a matter of fact, we deal with the specific case of 
the semi-Markovian process, i. e., a Markov chain. The 
reason is that the time of transition from a state to a state is 
constant and equals to t, +t), where ¢, is cell transmission 
time and ¢, is HEC processing time in the receiver. Essen- 
tially, HEC processing time is random, but we neglect time 
variation in order to simplify our model. Assume that 
ty = 3t,.The probability transition matrix G is structured 
as 
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Now, based on mathematical expectation and variance of 
sojourn time z, in HUNT state, we can determine the 
distribution function of z,. Assume that 7, follows to 
Gamma-distribution. 

Up to now, there is no sufficient statistics on error 
models in fiber-optic channels. Therefore, it is reasonable to 
analyze the two following realistic models: the Bernoulli 


memoryless model and the two-state simple Markov chain 
memory model. 


3. COMPUTER ALGORITHM 


Let us define the upper boundary of the undetected 
error probability. Assume, at first, that channel errors are 
independent. Suppose that Q is a false synchronization 
probability. The following notations are introduced: 

{A} is a set of code combinations, Ap is a zero code 
combination, 4,(i = 1,...,2°2 — 1) is the i-th code combination, 
{B} is a set of Standard Array elements, B; is a Standard 
Array element and S; is a j-th coset leader (Figure 2); go is 


a probability of undetected transition from any code 
combination 4, to another one 4j; M is a number of 


Standard Array cosets, N is a number of code 
combinations; q, is a probability of transition from any code 
combination A; to non-correct combination 8; in Standard 
Array; d is the Hamming distance of the code (40, 32) 


appropriate to the HEC control relations; 
B, is anumber of code combinations of weight v and a, is 


a number of cosets leaders of the weight v; P is a correct 
synchronization probability; y,, is a number of the Standard 


Array elements B, of the weight v that differ from all 4, 
and Sj; p, is probability of transition of any code 
combination A, to any Standard Array element that is 
placed in the same row with 4, and pp is probability of 
correct receipt of code combination; ¢ is a bit error 
probability in Bernoulli model and _, is a final bit error 
probability in the Markov error model; y is a maximal 
number of corrected errors in the HEC code combination. 


Fig. 2. Standard Array 


As a result, probability of false synchronization is 


es Got 1 
9fo+%1 + Pot Pi 


Based on the notations, we determine probability of detected 
but not corrected error as 

P=1—po- Py-9o- 41: 
Suppose that we deal with the binary symmetric 
communication channel without erasures. Then we can 
produce the following equations: 


Po =(I-e)" (1) 
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Taking into account that a = 1; fy = 0;79 = 0 we simplify 
the formula (2): 
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Here a, =a, +,+7, isa total number of the Standard 


Array elements of the weight v and £, = 8, +7, isa total 
number of the Standard Array elements of weight v that 
differ from the elements of the first row that contains the 
code combination Ap. 

Now we follow to determining of the Q upper boundary. 
It is evident that 
p= §{"lra-s 3) 


where p. is a maximal number of detected errors. 
Consequently, from (1) and (3) we conclude 


iad n v n-v 
PotP= 2) je d-e) - 
v=0\ V 


In our group code, weight of any code combination is at 
least d. Therefore, we produce estimation 


do S(N -Ne“(1- 2)". 

Weight of any element of a coset (except coset leader) in 

Standard Array (Figure 2) is not less than d-v, where V 
is the weight of the corresponding coset leader. There are 


@ components of the weight V (v =1,2,...,4) in each 
v 


coset. Consequently, 


q <(N 1) ("a <2 ate 
v= V 


As aresult, the probability of false synchronization satisfies 
the condition 
O=1-P< 


(N a n§{"}e" i- eye” 
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Taking into account that in our case  =1,n=40,N = ba: 
and d=3 [7], the formula (15) is presented as 


(3 -1}? (-e)38 
O< 
(1-2)4° + e(1—2)39 +(232 -1[e3 1-2)” +2 (1-2) 38) 

Direct determination of cell false synchronization 
probability in the case of Markov channel model is 
essentially complicated. The reason is that the correlation 
between the erroneous bits should be taken into account. As 
a result, the evaluations of probability transition 


Ag ——> {A} and Ap —>{B, \ (i #0) causes a lot of 


calculations in the case of Standard Array dimension by 


x41. Thus, we propose an algorithm that enables to 
determine the Q upper boundary. 


The probability of transition from 4) to {4} or from 4) to 
{B} is E < {6P(d +1, n)+(1-0)P(d-1, n)} 

We assume that the distance d(4y,4,) is equal to d, thus the 
inequality is reasonable. Also, we take into consideration 


that the set of cosets, except the zero coset, includes in our 
case only the combinations of the weight 1. Under this 


assumption, any transition 4) —— B; (i #0) leads to an 
increase of weight of B; combination to the value d+1 or 


follows to decrease to the value d-1. The coefficients @ 
and 1-8, 0<6 <1, define the relative amount of these 
transitions. 


4. NUMERICAL RESULTS 


Figure 3 demonstrates dependence an upper 
boundary of the probability O as a function of an argument 
¥2 referring to the various values of the parameter 0. The 


graphs show that in the range [0,1], variations of Q are 
insignificant in the case of our code. Case | corresponds to 
@=1, case 2 corresponds to 6 =0.5 and case 3 corresponds 
to 0=1. 

Figure 4 demonstrates behavior of an upper boundary of the 
probability O as a function of an argument ¢. Two cases are 
considered. Case | corresponds to implementing the code 
(40,32) in the single error correction mode. Case 2 
corresponds to the error detection mode only. 


5. CONCLUSIONS 


We presented a general mathematical model of a CD 
procedure in an ATM network. One of the basic states of 
this model is the HUNT state that is related to probability of 
cell false synchronization. We analyzed this probability as 
a function of channel quality referring to two possible error 
flow models. Specifically, we took into account the HEC 
code properties. Based on numerical results, we evaluated 
the range of Q values in the real conditions. 

It is evident that Q increases when € or 7 


increases. The dependence of probability Q on bit error 
probability is substantially stronger in the case of Bernoulli 
model. The reason is that the distribution P(i, n) of i 
erroneous bits in the data unit of n bits is more monotonous 
in the case of Markov model compared the Bernoulli model. 
It follows from the figures that an implementation of 
error correction procedure in the cell header significantly 
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increases the probability Q. Creating the universal platform 
to compare Q for the different error flow models is 
problematic because the errors due to structuring of upper 
border Q in both cases are essentially different. Therefore, 
we produced and analyzed the special borders for quanti- 
tative evaluation of false synchronization probability. 
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Simulation approach for functional verification 
of an “ATM over ADSL” integrated circuit 


E.Zervanos', V. Dimitropoulos”, A.Manolitzas’, and G.Stassinopoulos’ 


This paper concerns the design and development of an 
application specific integrated circuit, targeting advanced 
ATM over ADSL applications. The work focalizes on the 
functional verification of this processor and presents the 
hardware and the basic software architecture of the entire 
integrated circuit. 

Index Terms--ATM, ADSL, Access Networks, functional 
verification 


I. INTRODUCTION 


The forthcoming B-ISDN and its _ packet-oriented 
transmission technology in Asynchronous Transfer Mode 
(ATM), as a widely accepted telecommunication system 
has many promising new telecommunication services, such 
as multimedia and high-speed data transfer. Those services 
demand higher bandwidth connectivity that can be 
transferred through fiber optic or coaxial cables. Plain Old 
Telephone System copper cables, offer a low cost access 
media both in telecom and datacom areas but the existing 
modems can not accommodate the forthcoming high speed 
multimedia services. The principal aim of Asynchronous 
Digital Subscriber Line (ADSL) is to exploit the presence 
of these copper cables by using advanced coding and 
modulation techniques in order to maximize the throughput, 
providing a suitable choice for higher speeds in the access 
network. [1] [2] The direction of the data path from the 
ADSL Transceiver Unit — Central office (ATU-C) to the 
ADSL Transceiver Unit — Remote terminal end (ATU-R) is 
denoted as downstream, while the opposite direction as 
upstream. The downstream and upstream rates of an ADSL 
system depend on the length and the noise characteristics of 
the local loop, having maximum respective values of 
8Mbit/sec and 640Kbit/sec. Efficient initialization 
algorithms analyzing the characteristics of the cooper line 
determine the values of the above rates. 
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The ATU-C transmitter reference model for ATM transport, 
shown in Figure 1, illustrates the functional blocks required 
to provide ADSL service. 
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Figure 1: ATU-C Transmitter Reference model 


The functionality of the “ATM over ADSL” integrated 
circuit (IC) extends from the V-C reference point to B 
reference point of the model and includes the interleaving 
functionality for the corresponding data path. For the above 
functions the ATU-C and ATU-R reference models are 
similar, the only difference being the rate that has to be 
supported; thus, the same integrated circuit can be used for 
both. [3] 

The purpose of this paper is to examine and test, by 
simulation, the functions that take place within the “ATM 
over ADSL” integrated circuit. These functions are 
separated in software and hardware parts, the basic 
architecture of which is proposed herewith. 


IJ. INTEGRATED CIRCUIT FUNCTIONALITY AND 
ARCHITECTURE 


The internal architecture of the integrated circuit is depicted 
in Figure 2. 
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The IC is a memory mapped peripheral device composed of 
dedicated high performance programmable Ingress and 
Egress data processing paths. It incorporates one embedded 
core per path and has four interconnect interfaces. The 
Modem i/f is a 20Mhz serial data interface, which 
interconnects the IC with the modem circuitry. This is an 
external circuitry that realizes all the ADSL functionalities 
from B reference point to analog front end and 
communicates with the IC using a per data frame 
handshake mechanism. The UTOPIA level 2 interface is 
used to interconnect the ATM Transmission Convergence 
(ATM-TC) sub-layer with an external ATM layer device. 
From the multi-physical capability of this standard interface 
only two are used, one for the fast and the other for the 
interleaved data path. The MP i/f, is a generic little-endian 
asynchronous interface that interconnects the IC with an 
external on board microprocessor. This microprocessor is 
responsible for initializing the ADSL rate and Reed 
Solomon coding parameters of the chip by writing to 
appropriate registers and processing the operational and 
maintenance flows of the ADSL protocol. This information 
is carried on the Embedded Operation Channel (EOC) on 
the ADSL Overhead Control Cannel (AOC) and on the 
Indicator Bits (IB) of the ADSL super-frame structure. 
Finally, the serial EPROM i/f, is a serial interface with an 
external serial EPROM and its responsibility is to download 
the software program for the two embedded cores, during 
power on procedure. 

The discrimination of the functionalities to software and 
hardware parts is depicted in Figure 3. All the ADSL 
related functions are developed in software using the two 
embedded cores, due to the high variable nature of the 
parameters of the ADSL protocol. The ATM TC and the 
interfaces related operations are implemented in VHDL and 
are synthesized as hardware blocks inside the chip. The 
only ATM functionality that is implemented in software is 
the idle cell insertion, as the encapsulation of the ATM cells 
to the variable rate ADSL data frame is a responsibility of 
the egress core. 


III. FUNCTIONAL VERIFICATION APPROACH 
The advantages of controllability and observability of a 
functional verification, allows a quicker, easier and more 
cost effective error tracking by using a pre-manufacturing 


simulation than by afterwards hardware debugging. The 
main concept of a high level simulation is to develop 
models to produce test vectors for the data path. Those 
vectors help on the veryfication of the functionality of the 
distinct modules, which are implemented either in software 
or in hardware. [4] [5] 
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Figure 3: Hardware — Software Separation 


In order to model a data path environment, we have to 
determine some traffic scenarios for the ADSL fast and 
interleave paths. Our test vectors are based on the traffic 
scenarios that are shown bellow: 

e Only fast traffic 

e Only interleave traffic 

e 50% fast and 50% interleaved traffic 

The parameters of the Reed Solomon coding and the depth 
of the interleave memory can have any valid value from the 
T1.413 standard. The models are created in Matlab, as this 
environment provides more efficient matrix calculations 
than other general purpose languages such as C. Each 
model corresponds to each distinct functional block of the 
IC as depicted in Figure 3. This verification approach is 
based on the fact that every functional block inside the IC is 
independent from each other, and only operates as a “raw” 
data processor between its input and its output. As far as 
this is maintained in the real operation of the system, the 
functional verification approaches the actual simulated 
operation. Since the main characteristic of the “ATM over 
ADSL” IC internal architecture is the isolation of the 
operational blocks, this simulation approach can be useful 
for the development of the system. 

During a typical verification procedure, the Matlab model is 
used along with an input test vector file to produce a test 
vector output file. Then, the actual under test function takes 
the place of the model with the same test vector input file 
and a new test vector output file is produced. The 
verification can be automated by comparing the two 
resulting output files and by producing a report file for the 
differences that may occur. A typical functional verification 
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of an implemented hardware block using VHDL is depicted 
in Figure 4. 


Produced from 
VHDL modules 


Produced from 
Matlab modules 


Figure 4: VHDL Functional Verification 


The test-bench is composed by the synthesizable VHDL 
function, the input and the output test vector files and two 
behavioral VHDL modules. These are used to assert the 
port signals of the module under test with the timing 
characteristics of its input and output interfaces and to feed 
the test-bench with data from the files produced by using 
the Matlab models. 

An overall simulation scenario for the ingress side of the IC 
is depicted in Figure 5. The test vectors, which are 
produced by using the MATLAB models, are used to 
support the software development. The binary file resulting 
from the compilation of the code, is inserted in the top 
VHDL testbench and the whole receive path is simulated. 

In order to simulate the noise characteristics of the local 
loop in the ingress path, the input ADSL data frame vectors 
heading to Modem i/f first pass through a Matlab model 
that inserts periodic bursts of error bits, using a 
programmable parametric algorithm. During the functional 
verification process, several report files are updated in order 
to provide information on the calculation of CRC or Reed 
Solomon errors. 


interleave 


Test 
Vectors § 


Figure 5: Top Level Simulation 


The model for the MP i/f consists of two major parts. The 
first comprises a behavioral model for the timing and 
operational characteristics of the interface, while the second 
is a behavioral pseudocode interpreter that executes small 
interrupt service routine pseudocodes and initiates bus 
transactions by recognizing read and write instructions. The 
above model cannot be developed in Matlab because it is 
not related only with data processing but develops 


interactions with other modules in the architecture. 
Therefore, it is developed in VHDL and it can only be used 
in the top simulation of the whole IC. [4] 


IV. CONCLUSION 


The key point of the functional verification is modeling a 
system at multiple levels of abstraction, in order to hide the 
internal structure of the system. During this approach we . 
are focusing on the behavior and the functionality of the 
distinct operational modules of the IC, at the same time 
ignoring any timing information at the top level of 
abstraction. This simplifies the overall simulation strategy 
but reduces the accuracy of the model. The real time 
Operational characteristics that affect the data streams can 
not be modeled using the above file-oriented processed data 
models. For the above reasons, the simulations of the idle 
cell insertion function and all the microprocessor related 
Operations, such as Dynamic Rate Adaptation (DRA) and 
Dynamic Rate Repartition (DRR), can not be based on this 
functional verification approach. 
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Experimental Study of Correlated Traffic in 
ATM Backbone Network 


Yury Zlotnikov, Emily Rozenshine 


Abstract-- The results of traffic measurements in the 155 
Mbit/s ATM backbone network are presented and analyzed. 
The traffic characteristics measured X(i) is a random number 
of ATM cells received by port-destination per a given time 
interval. It is shown that X(t) can be described by a non 
stationary random process. The cell traffic’s self-similarity 
feature is demonstrated. 


Index terms-- High speed networks, ATM technology, traffic 
statistics 


1. INTRODUCTION 


The ATM (Asynchronous Transfer Mode) 
technology is considered now as an advanced and universal 
platform for high speed network architecture [1]. The 
comprehension of ATM traffic characteristics is of 
particular importance in design and research of high speed 
network protocols [1]. Investigation of cell flows in ATM 
networks is an actual problem. Appropriate statistics are a 
basis for developing the probabilistic models of ATM 
switching nodes and end-to-end connections. The 
traditional data flow models of the Bernoulli or Poisson 
type appeared to be not realistic in ATM networks. New 
more realistic models can be structured on the basis of tests 
performed on the working network. A number of such 
models have been developed recently for a mesh-oriented 
topology of ATM network [2] that are based on the concepts 
of Markov modulated Bemoulli process and Markov 
modulated Poisson process. A paper [3] has shown that 
there exists more long-term correlations in data and video 
traffic. However, up to now there is a lack of experimental 
cell flow statistics referring to the backbone ring topology 
of the ATM network. The ATM network of this type 
incorporates, for example, a number of Ethernet-type Local 
Area Networks (LANs) which are allocated in different 
buildings. The problem is to obtain and to process cell flow 
real-time statistics. To solve this problem, we have studied 
cell flows in a real ATM backbone network and have 
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analyzed the traffic characteristics on a basis of random 
process theory. 


2. ORGANIZATION OF EXPERIMENTS 


The ATM backbone network topology and cell 
flows are shown in Fig. 1. Besides exchange of internal 
traffic between LANs, connections are designed from LAN 
hosts to external networks. The backbone ATM network 
includes four first-layer ATM switches, a second layer 
ATM switch and a set of 155 Mbit/s channels using OC-3 
SONET/SDH interfaces. Seven different network ports 
were examined during the experiment. 


Fig. 1. Network topology 


The experimental data referring to each port were 
produced by a number of stages detached by a specific 
interval A. The experienced cell flow characteristic is 
intensity, i.e., the number of cells received during 0,5 
second interval. Each stage is treated as an appropriate 
outcome. Due to significant delay A, the outcomes are 
independent ones. An abscissa defines a number of standard 
0,5 second intervals assumed to be ten different outcomes 
were produced in each experiment that corresponds to any 
specific value A. A typical set of outcomes are presented in 


TAA ig nnn 
164 1 


0” Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


0-7803-6290-X/00/$10.00 ©2000 IEEE 


Fig. 2 — Fig. 5 considering the different values of time delay 
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Fig. 2. The process’s outcomes, A = 15 minutes 
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Fig. 3. The process’s outcomes, A = 15 minutes 
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Fig. 4. The process’s outcomes, A = 24 hours 


Fig. 5. The process’s outcomes, A = 1 hour 


3. DESCRIPTION OF CELL FLOW BY STOCHASTIC PROCESS 


To describe each port’s cell flow by stochastic 
process, we introduce a set X(t, €) of functions, where f¢ is a 
current time and é is defined as an outcome related to a 
specific stage of experiment. This set is qualified as a 
stochastic process. A random variable X under 
consideration is the intensity of cell flow. The typical 
measured characteristics of the stochastic process are 
presented in Fig. 6 and Fig. 7. The first of them refers to a 
non-random function m,(¢) , where any value t corresponds 


to the mean of the process at the appropriate section. The 
second characteristic is the process’s variance D,(t) as a 


function of t. 
In all experiments performed with the different 
ports, the correlation function is 


Ky (tt) = Mx(e )X(b J) 
where M is the symbol of averaging and the centralized 
random variable is 


X(t, )= X(G)- mt) 
which depends on not only the value of interval ¢, —f,, 


but also on the initial moment ¢,. By this, the cell flow 


under consideration can not be treated as a stationary 
random process. This feature also follows from Fig. 6 and 
Fig. 7 where m,(t) and D,(t) are not the constant 


functions in the observed time interval. 
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Fig. 6. m,(f) function, A = 15 minutes 
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Fig. 8. Typical correlation function, A = 1 hour 


A typical correlation function referring to the 
specific initial section is presented in Fig. 8. 

During the experiments, some distinguished ports 
demonstrated “non typical” behavior in specific time 
intervals in the case of high information load. This feature 
manifested itself as a considerable deviation of the 
m,(t)and D,(t) functions. F.e., Fig. 9 and 10 show these 


functions for the port 3.4 considering A= 15 minutes. 
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Fig. 9. m,,(f) function, “non typical” behavior 
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Fig. 10. D,.(¢) function, “non typical” behavior 


Definitely, the correlation function in these 
experiments is more irregular in comparison with the 
correlation function of Fig. 8. 

The experiments have demonstrated the cell flow 
self-similarity. This property means that the traffic shows a 
long-range dependence regarding cell flow intensity as a 
function of time. According to [4], the stochastic process 
can be treated as a self-similar if its autocorrelation function 


a(t) = M[X()X(t+ Ad] 


can be expressed by an exponent L(t)t-* with the Hurst 


parameter H =1- 8/2, if 1/2 <H <1. Tab. | presents the 


Hurst parameter typical values referring to the different 
ports and different delays A. In all cases, the real 
autocorrelation function was approximated by an 
exponential function. 
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Tab. 1. Calculation of the Hurst parameter values 


It is obvious that all stochastic processes examined 
show characteristics that are consistent with self-similarity. 
The higher value of H, the higher is the level of 
autocorrelation and, consequently, the worse is ATM switch 
performance regarding cell loss probability and cell delay 
variation. 

Although the process X(t, €) has been defined as 
non stationary in a full range of testing time, some specific 
intervals in the test program can be distinguished at a 
number of ports. In any interval AT of this kind, we treat 
X(t, & as a quasi stationary process. This means that during 
each AT, the following inequalities are supported: 

m,(t)min 


In our experiments, 6, was defined as 0,6 and 6, was 


defined as 0,2. 
By this, the maximal relative deviation in m,.(t) 


and D,(t) for all teAT values is limited by the 


corresponding boundaries. For example, such quasi- 
stationary process was distinguished at port 3.3 in the 
interval AT=/1;25] seconds regarding the experiments with 
A = 15 minutes. An appropriate normalized correlation 
function in this interval is shown in Figure 9. This function 
is structured under the assumption that the correlation 
function in the interval AT does not depend on the initial 


moment f,. 


Fig. 9. The normalized correlation function of the process X(t, 4 in the 
interval [1;25] 


A general methodology to analyze an influence of 
correlation in ATM cell flow on ATM multiplexer output 
characteristics is presented in [5]. 


The basic experiments and analytical evaluations 
were carried out in the framework of a Master degree 
research [6] performed at Bar-Ilan University, Israel. 


4. CONCLUSIONS 


We have studied cell flows in the 155 Mbit/s ATM 
network and analyzed their characteristics on a stochastic 
process platform. The non stationarity, specifically the 
self-similarity feature of the traffic demonstrates a necessity 
to develop the nontraditional cell flow analytical models. 
These models should be relevant in design of ATM 
switching architectures, buffer dimensioning in switches 
and optimal network resources allocation. One reasonable 
way to structure these models is to apply a Pareto 
distribution [7]. 
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A Techno-Economic Study Of Broadband Access 
Network Implementation Models 


Milan Jankovic, Zoran Petrovic Member, IEEE, and Miroslav Dukic Member, IEEE 


Abstract:—Necessity of the advancement of telecommunication 
infrasrtructure is being emphazied according to the rapid 
progress of inovation of telecommunication technology and 
information society. Efficient and cost-effective Access 
Network (AN) evolution to provide seamless service to end 
users through fully interoperable AN is very important to 
achieve the goals of Global Information Infrastructure (GII). 
In this paper, the concept and requirements for broadband 
AN, to upgrade existing AN through interworking and 
intergration are presented. The results a techno-economic 
evaluation of AN implementation models are also shown. 


I. INTRODUCTION 


The existing AN represents a significant asset for 
telecommunication operators and is widely regarded as a 
key enabler for provisioning of the new advanced services 
in the increasingly competitive telecommunications 
environment. Nevertheless, in preparing for an upgrade of 
the very cost sensitive access network segment in a 
bussines/residental broadband market which is 
characterised by a high uncertainty and corresponding high 
risks, the operators are challenged by a series of significant 
strategic decisions with respect to services offer, 
technology choice and how to migrate from the current 
platform towards interactive broadband future; all in view 
of the underlying fundamentals of where and when to invest 
in order to make money with access competition. From that 
reason, in this paper, implementation models of Access 
Networks to support future multimedia services as a part of 
GII are proposed. Also, techno-economic analysis of major 
factors of AN models upgrade is presented. 


Il. ACCESS NETWORK TARGET STRUCTURE 
SURVEY 


The primary purpose of developing present 
telecommunication network is the intention to migrate to a 
network architecture offering all services in a cost-effective 
way over only one access point. Access network 
architecture, as a part of the whole telecommunication 
network, fulfilling such requirements, is shown in Fig. 1. 
The AN supports the transport of information, signalling 
interactive data and control bit streams between the CPE 
and the core network. Representing up to 60% of the 
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infrastructure costs of telecommunication networks access 
technology is the key definer of what services can be 
delivered to the subscriber, making it the focal point in the 
quest for revenue and profit. In 1996 the access network 
accounted for 6 billion USA $ of telecommunication 
business world-wide. It is growing at an average rate of 
approximately 27% per year. By the year 2000-2001 it will 
have a value of about 19 billion USA $. In other words it 
will more than triple in five years [1]. Due to this, it is 
necessary to pay a special attention to the access segment of 
the telecom network in the process of telecommunication 
infrastructure modernization in developing countries. 

The key point of this new network architecture is Access 
Unit (AU). The AU is a generic platform for different types 
of distribution technologies and services and connects the 
AN to the nodes of different core networks (data networks 
or the PSTN). Every model of the AN uses a special AU, 
which is composed of four functional sub-units: distribution 
unit (DU), switching unit (SU), control unit (CU) and 
interface unit (IU) [2]. Such an AN architecture has two 
distinctive features: 

I. A Common Interface between the Distribution Unit 
Offering the Narrowband and Broadband Services and 
the other Units 

2. A Common Control Platform for All Distribution Units 
IU is the common sub-unit for all models and has a task to 
provide all necessary adjustments to connect AU with 
transmission network and it comprises the whole range 
from plesiochronous digital hierarchy to SDH and ATM in 
an integrated manner. 

Progress in technology and evolution of existing specialised 
networks into integrated services network affects the 
upgrading of the switching systems (in construction and 
performances to achieve lower prices). SU is expected to 
provide the application of existing and future switching 
systems in AN in order to simplify design, manufacture 
and, especially, operation. It is divided into two parts: 
narrow band part providing bit rate of n x 64 kbit/s and 
ATM part designed to allow the switching of channels with 
any bit rate. The main use for ATM is seen for interactive 
multimedia applications. 

The CU is intended to support collection, transmission and 
processing of the AN data upon which the decision of the 
AN management can be based aimed at utilising network 
elements to a maximum possible extent. The main 
management functions that CU has to provide are: fault 
management,performance management and configuration. 
The TMN controls the complete infrastructure of the whole 
access zone. Its main function is to provide control and 
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monitoring over different services regardless distribution 
media in a unified manner. Control units are installed in 
each network functional part. A CU in AU is connected to 
TMN via Q; interface with the associate protocol. 


Public Network 


Customer 


Network 
Control 
Functions 
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Control 
Functions RO eae Se ian 


_| Distribution 


--4------- 


Fig. 1. Access network target structure 


The new concept of the AU enables the offering of services 
and Telecommunication Management Network (TMN) 
facilities in a modular way, independent from the 
distribution media. In other words, the concept offers the 
flexibility of changing the distribution media without 
changing the equipment and the TMN operation system. 
Also, a simple expansion is possible in terms of new 
services or in terms of adding customers to existing 
services. These are key advantages due to the changing 
environment and the difficulties to get precise forecasts of 
the future service needs. 

With the consideration that future needs for transmission 
capacity requirements can reach 1 Gbit/s for the size of 
access area in terms of the number exceeding of about 2000 
homes , such a network has to be served by one AU. Due to 
different access models the AU has to be suitable for the 
indoor as well as for outdoor applications. 

Since distribution media, applied technology and type of 
services define a particular model, a separate distribution 
sub-unit (DUj- 1,2. 6) is provided in AU for each of them. 
It must be stressed that the only functional sub-units 
dependent on the applied model are the DU and service 
specific part of PCP. Other functional units in AU are 
general and common for all models. 

A physical interface in a reference point RP] (between 
internal equipment on customer premises and distribution 
media) has to provide terminal equipment connection for 
the following services: CATV, B-ISDN(ATM), N-ISDN 
and PSTN. The NT; = 1,2...6 terminates the AN providing 
different access functions dependent on the distribution 
media used, but it must be service independent. 

In a reference point RP2 type of physical interface is V5.x 
(for services realized using SDH ring structure). At the 
beginning V5.1 interface will play a significant role. Later 
V5.2 interface will be used allowing _ statistical 
concentration within the AN and reducing the transport 
capacity needed for the transmission network. 


Access Network | Network 


III. ACCESS NETWORK IMPLEMENTATION 
MODELS 


New services are a major driving force behind the evolution 
of the existing telecommunication network towards creation 
of the GII, in the part of AN. Due to that reason it is 
necessary to define and develope more alternative models, 
because it is clear that there is no single solution to the 
diverse needs. As a result of analytical researches, six AN 
models, according to the used transmission technology, 
primary oriented towards provision of multimedia services 
for developing countries are defined. 
MODEL 1: Provision services over _ existing 
infrastructure-In the existing networks with infrastructure 
based on metallic cables the customers for telephone, leased 
lines and N-ISDN services are connected to AU through 
DU1. DU1 characteristics must be: 
e based on Recommendations G.702, G.703, G.730, 
G.740 and G.750 series for PDH transmission systems, 
e fulfill Recommendation 1.430 (transmission bit rate 192 
kb/s, line code 2B1Q), for Basic Access and 
Recommendations 1.431 and G.730 (transmission bit 
rate 2048 Mb/s, line code HDB3) for Primary Access 
for ISDN concept. 
MODEL 2: The use of xDSL to provide high-bit-rate over 
copper pairs- Pressing need in communications today is to 
deliver interactive broadband services, such as high-speed 
Internet and video conferencong to small bussines and 
residental customers. A possible way to increase the 
bandwidth and capacity, with the existing infrastructure is 
to use Digital Subsciber Line (xDSL) technology (since 
there are so many version of DSL systems, people have 
used name xDSL,where x stands for any version) through 
DU2. 
ADSL technology enables the distribution of high-speed 
services over the existing twisted pair copper network 
without regenerators. Then in DU2 new modulation (DMT 
- Discrete Multi-Tone, QAM - Quadrature Amplitude 
Modulation, CAP - Carrierless Amplitude Modulation) and 
line code techniques (AMI, 2B1Q, HDB3) are used to 
support asymmetrical transmission between 1.5 Mbit/s 
(max. distance 5.5 km for conductor diameter of 0.5 mm) 
and 6 Mbit/s (max. distance 3.6 km for conductor diameter 
of 0.5 mm) in the downstream direction and upstream 
control channel of 16 to 640 kbit/s [3]. The ADSL will 
mainly be applied to distributive and interactive VoD 
services ( it allows simultaneous viewing of two different 
video channels with MPEG1 compression or one channel of 
NTSC quality video with the forthcoming MPEG2 
standard). 
If VDSL technology is used, DU2 has to be able to carry 
higher bandwidth services (access to Internet and 
multimedia datebases, video distribution,HDTV) up to 50 
Mbit/s (downstream), 1.5 —2.3 Mbit/s (upstream) on short 
lengths (typically a few hundred meters). 
For HDSL technplogy, DU2 has to provide two-way 2.048 
Mbit/s transport over telephony lines up to 3.7 km for 
conductor diametar of 0.5mm without a mid-span repeater 
and up to nearly twice this distance with one mid-span 
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repeater (based on a 2B1Q line code, Recommendation 
G.991.1) [3]. 

MODEL 3: Provision services over fibre cable networks 
using B-ISDN-In cases where there is no infrastructure 
based on copper cables or where investments in new 
infrastructure are economically justified, it is necessary to 
employ optical fibre cables as distribution media. This 
enables the application of Fibre In The Loop (FITL) 
technology (Passive Optical Networks, PON, or active star 
structures). In that case for provision of voice/data/video 
services, principles of B-ISDN are used. DU3 sub-unit is 
designed for such situations. 

For delivery POTS and ISDN services in FITL concept 
PON became the forefront as a solution to the needs of the 
local AN. In that case DU3 has to be capable to receive up 
to 16 x 2,048 Mb/s (480 is max. number of 64 kb/s access 
channels) according to Recommendations G.703, G.704, 
G.732 and support a Byte Transport System operating at 20 
Mb/s, over two single mode optical fibres (operating wave 
length range 1260 -1360 nm, fibre specifications ITU 
G.652). As optical transmitter, due to economical reason, 
Fabry-Perot-Laser (-7dBm sensitivity) and as optical 
receiver PIN diode (-40 dBm sensitivity at BER of 10°, 27 
dB dynamic range) are used. Maximum remote controlled 
CPE distance is 20 km. 

MODEL 4: Hybrid fibre/copper (HFC) access scenario-If 
telecommunication infrastructure is based on optical fibre 
cables, the distribution of interactive video and multimedia 
services is provided by DU4 (configurations HFC and 
FITC, FTTH through Broadband PON-BPON, §are 
acceptable for CATV operators and local exchange carriers, 
respectively). The provision of broadband services requires 
AN enabling bi-directional asymmetrical communication 
of interactive type. For the HFC solution, most used in 
modern AN networks in near future, DU4 consists of a set 
of devices for two-way transmission (an optical transmitter, 
DFB-Laser, for downstream path 47 - 860 MHz and one 
optical receiver, PIN diode, for each upstream path 5 - 30 
MHz). 

MODEL 5: The use of the Wireless Local Loop (WLL)- 
There is a huge world trend of extending the existing and 
istalling new telecommunications systems using the same 
principles as used in cellular networks (modern digital 
techniques, efficient spectrum utilisation). These systems 
are known as WLL. WLL technology allows the new 
alternative service providers to rapidly (within 3-6 months) 
capture residental and small-business customers using fixed 
wireless access technology. It has the potential to support 
mobility and broadband services. Also, it can coexist with 
Personal Communication Systems (PCS) and cellular 
networks and can support Internet service as well.It is 
essenttially a threat to the wireline POTS providers unless 
they also move toward that direction to complement their 
wireline infrastructure.WLL access is made through DUS. 
DUS characteristics depend on: customer mobility and 
density, in selected areas, RF band utilization (depends on 
regional specific regulations), channel capacity for 
transport, customer access method, type of service 
provisioning to the customer, etc. 
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MODEL 6: Access using satellites-If satellites are used for 
distribution of services to a greater number of customers, it 
is necessary to use DU6. Digital technology will be used in 
future, for delivery of voice, data, and video services using 
satellites. The application of digital transmission and Ku 
band makes possible the use of small diameter antennas 
and it eliminates distortions and failures (interference, 
reflection) which are typical for analogue transmission. Due 
to those reasons, DU6 must provide digital processing, 
compression, multiplexing, encoding and QPSK 
(Quaternary PSK) or QAM modulation. 


IV. TECHNO-ECONOMIC EVALUATION OF THE 
MODELS 


The AN is the most cost demanding part of a 
telecommunications network and in the same time the one 
most sensitive to the regulation and competition. In a 
competitive environment, network infrastructure upgrades 
must be economically, that is cost effective: new 
investments must generate positive returns, possible in a 
short period. 
The introduction of a new technology or the upgrade of an 
existing one in the AN is an expensive project for any 
telecom operator, especially for newcomers and therefore 
introduction and implementation strategies are required to 
evaluate the best part to realise the new infrastructure. The 
techno-economic evaluation is proces in which three phases 
can be identified: 
e definition of the models to be evaluated 
e evaluation of the defined models, using an adequate 
SW tool 
e analysis of the results and the production of the outputs 
A model is the description of a network situation to provide 
a given set of services to a number of customers within a 
certain area and study period. For every model following 
parameters are defined [7,8]: 


Ca 
Fig. 2. Cost survey for Models 1-6 (urban) 
1. AN architecture description — the network elements are 
characterised by: 


e prices 
¢ parameters characterising the price evolution 


¢ parameters characterising the OA&M _ costs 
induced by the network elemenst 


¢ parameters describing the system modularity and 
network parameter modularity 
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2. Service penetration over the study period 

3. Set of services — in this techno-economic evaluation 
fixed services (POTS, ISDN, 2M ASB, 2M SSB, 8M 
ASB/SSB) up to 8 Mbit/s and mobile services (POTS, 
ISDN, 2M ASB) are analysed. Definition of revenues 
and tariffs for each service is proposed. 

4. Area definition — three types of area have been defined: 

downtown, urban and rural with a set of attributes to 

characterise them (predefined number of access areas 

described in terms of their radius, living units density 

and duct availability) 

Proces of structural reforms 

6. Study period — set of seven years, from 2000 — 2006 
have been specified 


nN 


erences 


A 
So 


= 
ECu) 


Profit (mitt, 


Fig.4. Profit survey for Models 1-6 (urban) 


After that, economic calculation was performed using 
originaly developed SW tool. ]9]. Main financial figures 
(cost, income, profit, net presen value — NPV) as a result of 
the calculation for urban area are presented in Figures 2-5 
respectively.Downtown and rural area results of the 
calculation are presented in [9]. 


V. CONCLUSIONS 


Advantages of the proposed AN architecture are: customers 
have access to the network providing full variety of 
services, customers are in a position to define and control 
services by themselves in a supportive environment using 


network capacity according to their needs, network will be 
significantly simplified (flexibility in service upgrades and 
access planned resource optimisation). 


Fig. 5. NPV survey for Models 1-6 (urban) 


According to the techno-economical evaluation, several 

conclusions can be pointed: 

e the cost of bandwidth in the AN is still very significant 
the significantly lower costs of ADSL/VDSL upgrades 
compared to FITL AN solution make this solutions the 
preferred wireline technology choice in the short term 

e HFC technology is an economically attractive solution if 
the public network operator has a CATV already 
available, but this is no case in developing countries 

e the AU configuration and corresponding degree of fibre 
penetration is a key strategic decision 

e wireless technology becoming more and more an 
alternative for providing multimedia services because it 
minimizes the problem of uncertainty in forecasting 
subscribers number, investment can be made when lines 
are required, can be used from low density areas to high 
density areas, from the fixed applications to mobile 
permitting fast network deployment (very important for 
developing countries without cable infrastructure) 
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Modeling the Structural Return Loss in Twisted 
Pair Cables 


Anton Umek, Member, IEEE 


Abstract— The paper presents the results of a computer 
simulation for non-uniform twisted pair. The non-uniform 
transmission line is modeled as a chain of homogeneous line 
segments, where equivalent linear two-port circuit replaces 
each homogeneous line segment. The accuracy of the seg- 
mental model can be made good enough by choosing short 
segments relative to signal wavelength. Examining the in- 
fluence of only one parameter can reduce the model com- 
plexity. A computer simulation has been carried out for 
periodic and random changes of twisted pair wire distance. 


Keywords— non-homogenous transmission lines, UTP, 


I. INTRODUCTION 


Transmission of digital signals on a short, two-wire line 
can take a very wide frequency band. Therefore, the impact 
of non-homogeneity on the echo and transmission proper- 
ties of a two-wire line has to be researched. In specialist 
literature, results of research carried out on the effects of 
non-homogeneity refer only to coaxial lines [1], {2]. This 
paper therefore presents an approach, which enables the 
analysis of the effects of non-homogeneity in the structure 
of two-wire lines. 

Besides the results provided by an accurate computer 
supported model for the periodically non-homogenous 
structure of a two-wire line, we could also find reliable ana- 
lytical approximations. The treatment of two special cases 
of a periodically non-homogenous structure with a rectan- 
gular and a sine profile primarily enables us to illustrate 
explanations of phenomena observed on a non-homogenous 
two-wire line. For a randomly non-homogenous two-wire 
line, an analysis of the probability distribution of the re- 
flected signal is also given. 


II. A MODEL OF LINE PARAMETERS 


A cross-section for homogeneous two-wire line is illus- 
trated in Figure 1. The dimensions in the section of an 
actual two-wire line differ from the designed dimensions 
for technical reasons. Exact modeling of non-homogenous 
two-wire lines would be too extensive due to the high num- 
ber of parameters. Therefore, an approximation model is 
used in which only the effects of changes in the wire dis- 
tance a(x) are observed: 

a(x) = @(1+ 6,(z)) (1) 
where 6,(z) is a relative wire-distance deviation. For small 
variations we can use approximate equations for primary 
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Fig. 1. Unshielded twisted-pair cross-section. 


line parameters: 


Clb.) = (2) 
G(ba) = Ss (3) 
LE (5a) = L(1 + 6a) (4) 


where C, G and L are primary parameters for a homo- 
geneous twisted-pair with wire-distance a. Approximation 
error in (2),(3) and (4) is less than 3% if 5, < 0.1. 
Non-homogenous two-wire line is substituted by an ap- 
proximation, represented by a chain of short homogenous 
segments with characteristic impedance Zp and propaga- 
tion constant y as shown in Figure 2. Each homogeneous 


ee 


Z,(%), ¥(x) 
0 bx (n-1) Ax nAx (n+l)Ax  (N-1) Ax NaAx x 
t ——— ee +— --—+—— + 
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: 2,7 Ze y? : ze Dy : ize dy Y 


Fig. 2. Transmission line segmentation. 


segment is modeled with equivalent two-port ABCD para- 
meters [3]: 
T= Ay BW XS) )cosh(7Ax) 
ie ~ | gsinh(yAz) 


Zo sinh(yAz) 


C yD cosh(yAz) (5) 


The quality of the two-port model depends on the accuracy 
of primary line parameters. In the model of line parame- 
ters the impact of insulator dielectric losses and skin effect 
[4] should be taken into account. An equivalent two-port 
matrix of a cascade is computed simply by multiplying the 
ABCD matrices for each section: 


N 
r= fl Tn (6) 
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The procedure is suitable for numeric analysis with com- 
puter support only. ?. 


III. STRUCTURES WITH PERIODIC IMPEDANCE PROFILE 


The characteristic impedance profile can have a periodic 
nature due to manufacturing process faults [5]. A pe- 
riodic impedance irregularities cause periodic reflections, 
which can add together with equal phases at line input. 
Even small deviations in characteristic impedance profile 
can therefore cause serious deviations in input impedance 
diagram. 


a(x) 
A 


= 


ae =e 


+ 
() X=K Ax X=N x, 


Fig. 3. Periodic wire-distance profiles: rectangular and sinusoidal. 


A. Approximate analytic solutions for rectangular periodic 
impedance profile 


The situation on line with periodic rectangular im- 
pedance profile can be analyzed by the method of traveling 
waves. The voltage reflection coefficient between adjacent 
line segments is: 


(7) 


where Z; and Z are the characteristic impedances of the 
adjacent segments. The conditions on line input and line 
output are illustrated in Figure 4. From an infinite number 
of reflected waves on the line input, it will be precise enough 
to consider only a finite group (2) of only once reflected 
waves. The variations in propagation constants will also 
be neglected: 71,72 * y. The total back-scattered signal 
is: 
2N 
Ureg = Uo(1+T)T S\(-1)Ftte7k 120 
k=1 


(8) 


Maximal values of reflected wave amplitude on line input 


will occur at the fundamental resonant frequency and its 
multiples: 


U 


fn Dao 


(2n + 1), where n = 0,1,2.. (9) 


where v is a phase velocity (approx. 66% co). 


1 All calculation has been made with MathCAD. 
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Fig. 4. A simplified travelling wave analysis for a rectangular im- 
pedance profile. 


The Structural Return Loss (SRL) expressed in decibels 
is defined as: 


Wire 
SRL = 20 log|—£| (10) 
Uo 


Figure 5 shows the results obtained by an accurate com- 
puter supported segmental model for a standard unshielded 
twisted pair (26AWG, CAT5). 
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Fig. 5. Structural return loss for a rectangular impedance profile 
with parameters da = 0.016, zo = 5m, N = 40. 


Approximate solution for SRL can be expressed from 
(8). The minimal SRL occurs at the fundamental resonant 
frequency fo: 


1 — Ag” 
eo 1 
Bae PAL (11) 


where Apo is the absolute value of the zo long line segment 
transfer function: 


SRL(fo) = —20 log |ba Ao 


Ao = e (fo) xo (12) 
The equation (11) can be further simplified for a very long 
periodic structure N — oo, if we use the f” model for line 
attenuation a(f): 


a(f) + Kaf” (13) 


10” Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


where a model parameter n is typically between 0.5 and 


0.6. : 
26a fo ba 
Ks | (14) 


For minor deviations 6 the findings of approximate analyt- 
ical model (14) differ very little from the results provided 
by an accurate computer supported segmental model. 


SRL(fo) © —20 log | 


B. Structures with sinusoidal profile 


Let us suppose that the characteristic impedance of short 
segments is real and linearly proportional to wire-distance 
deviations: 

Zo = Zo(@)(1 + da) (15) 


The reflection coefficient value is therefore real and neg- 
ative proportional to the gradient of wire-distance devia- 
tion on each point along the line. Transmission line loss 
rise exponentially with distance and the reflected impulse 
response is therefore an exponentially weighted sinusoidal 
wave as illustrated in Figure 6. The path of the total re- 
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Fig. 6. Top: Hypothetical testing conditions. Bottom: Reflected 
signal on twisted pair with periodic sinusoidal wire-distance de- 
viation. 


flected wave can be approximated with a single resonator 
transfer function H,(f) only if the number of periods N is 
sufficiently large: 


Ho 


= 1 aD Pig 


H,(f) (16) 


The calculated resonator parameters for the case 6, = 1%, 
Zo = 2m are: fo = 54MHz, Ho = 0.25, Q = 50. Resonator 
gain Ho and quality Q can be extrapolated with empiric 
equations: 


Ho (6a, fo) = Hola fa) 2 ey (17) 


Qlfo) = QA) (By (18) 
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where n is the best-fit exponent in f” line attenuation 
model (13). 

The minimal return loss occurs only at one fundamental 
frequency fo: 


SRL(fo) = —20log Ho (19) 


The results of computer simulation are shown in Figure 7. 
Exact course of SRL(N) asymptotically approaches with 


Fig. 7. Signal return loss at fundamental frequency fo. 


N its minimal value (19) at which the aforementioned ” res- 
onator model” is valid. In all cases the boundary value for 
N is between 10 and 100. 

The reflected signal path for any other periodic im- 
pedance profile can be modeled as a group of resonators 
connected in parallel. As an example of such decomposi- 
tion we can use the rectangular profile: 


co 


x 4 
a(x) =a(1+da—)) 


k=1 


Seq sin(2m(2k - 1))) (20) 


From (11) and (20) we conclude, that the difference in re- 

flected signal amplitude at fo for a rectangular and sinu- 

soidal profile with the same deviation 6, is about 2dB. 
IV. RANDOM IMPEDANCE PROFILE 


Impedance discontinuities can also be randomly distrib- 
uted, as for wire-distance profile shown in Figure 8. The 


P;(6,) 


Fig. 8. Random wire-distance profile. 


reflected waves on line input have random phases and ran- 
dom amplitudes for all frequencies. Real and imaginary 
components of the sum of all reflected waves are mutual 
independent Gaussian random variables. Both variables 
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have equal variances and zero means. The magnitude of 
the total reflected wave has therefore a Rayleigh proba- 
bility density distribution. The frequency dependence of 
SRL for a random profile can be calculated numerically by 
computer. 


A. Computer simulation 


Transmission line with a random wire-distance profile is 
modeled as a chain of short homogeneous line segments as 
shown in Figure 2. The random profile 6,(nAz) is a sum 
of J > 2N sinusoidal components with random phases: 


I 
da(nAz) = 5,[n] = 5 ie So sin(ni = +6{t]) (21) 
i=1 


All spectral components of the sampled function 4(z) are 
selected to have equal amplitudes. At a given signal wave- 
length, the influence of all profile harmonic components 
with a period less than 4/4 is practically negligible. The 
frequency range of interest should be therefore limited to 


Vines: r ij 
ee ee = 29 
rnin 8Axz ( ) 


The results of computer simulations in 100 frequency 
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Fig. 9. SRL on 100m long random non-homogeneous twisted pair 
with parameters: o5 = 4%, Az = 10cm. 


points for 1000 randomly generated wire-distance profiles 
are shown in Figure 9. The attenuation of the mean value 
of reflected signal around a given point SRL(f, = 50Mhz) 
can be approximated with an equation: 


SRL(f) » SRL(fi) — 14 log + (23) 
Besides the mean value of reflected signal, a curve for the 
maximal reflected signal for all 1000 generated cases is also 
shown in Figure 9. A dashed curve with a 9dB offset below 
the SRL(f) represents a border with probability 99.98%. 
The probability density distribution of the reflected signal 
is shown in Figure 10. A histogram at f = 100MHz for 


all n = 1000 profiles is compared with an exact Rayleigh 
distribution function. 
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Fig. 10. Probability distribution of reflected wave amplitude at fre- 
quency 100MHz, SRL = 19.4dB 


V. CONCLUSIONS 


The results of comprehensive computer simulations in- 
dicate that random structural changes along the two-wire 
line have an impact primarily on the course of the input 
impedance. The differences in the attenuation of the trans- 
mitted signal are insignificant. The undetermined nature of 
the input’s impedance frequency course is expressed with 
SRL(f) , which in a simultaneous two-way transmission 
represents a significant parameter for the design of the echo 
canceller. The results of computer simulations are based 
on the assumption that the random variable 6,(z) has a 
uniform power spectrum. However, it seems more likely 
that the power spectrum is a declining function. In this 
case the SRL(f) would fall with frequency for less than 
14dB per decade. 
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Abstract — The expression for the radiation 
impedance of a half wave length wire dipole an- 
tenna is evaluated by using plane wave expan- 
sion. Usually the impedance of a wire antenna 
is expressed with cosine and sine integral func- 
tions. In this study an alternative approach is 
done which leads to the expression for the ra- 
diation impedance which contains Bessel func- 
tions. 


I. INTRODUCTION 


The radiation impedance of a half wave length 
dipole antenna is usually presented in terms of cosine 
and sine integrals [1], [2], which are special functions 
tabulated in literature [3]. However, cosine and sine in- 
tegral functions may not be familiar to engineers who 
are not specialised to antenna theory and advanced 
field theory. In this study the values of the radiation re- 
sistance and reactance of a half wave length wire dipole 
antenna are given in terms of Bessel functions of the 
first kind. The solution-is an infinite series of Bessel 
functions with even index for resistance part and with 
odd index for reactance part. This method can also 
be generalised to consider wire antennas whose lengths 
are even multiples of the half wave length. 

In this procedure the expression of the antenna 
impedance is developed such that the part which de- 
pends on the antenna length is separated from the part 
which depends on the angle 6. The part which depends 
on the antenna length is written with Bessel func- 
tions and the angle dependence is presented in terms 
of Chebyshev polynomials of the second type [4]. The 
angle dependence is integrated thus giving coefficients 
associated to the corresponding Bessel functions. The 
integration of the Chebysev polynomials can be easily 
done. The impedance value obtained in this study is 
compared to the known value found in literature. 


II. RADIATION RESISTANCE 


As an introduction let us start by considering fields 
radiated from a line source 
J=u,Isin[k(h—|z|)], -h<z<h (1) 


The electric and magnetic fields are in far zone 


is the field pattern function of a wire dipole antenna [1]. 
The angle 6 denotes the angle between z-axis and the 
field point. The radiation resistance is defined through 
the radiated power 


1 
pe 5 | Re(ExH"}-ds (4) 
2 Js 
which is equal to the power delivered from the antenna 
circuit i 
Pe 5 RT? (5) 


Inserting the expressions for fields and using the surface 
element dS = u,r? sin@d@dy the following expression 
for the radiation resistance is obtained 


us 


ba / F?(0) sin 0d0 (6) 
27 
0 


where 7 = \/Uo/€o = 376.7 2 is the value of the wave 
impedance in free space. 

Now the length of the wire antenna is a half wave 
length (2h = A/2), that is, the parameter kh = 5. The 
field pattern function reduces to the form 


cos (5 cos @) 


Oy) — (7) 


sin 6 
and the radiation resistance is 


n / cos? ( cos 6) 


= Qn sin 6 
0 


dé (8) 


After the change of variable [1] 
u=cos@, du=-—sin6d@ (9) 


the expression for the radiation resistance is trans- 
formed to the form 


2(2 
pap f SP au (10) 
1— vu? 

=i 


After that decomposing the denominator of the inte- 
grand into two parts 


1 1 1 1 
a 11 
1-w? s(aeetica) BD 


— INL F (9) 0 jkr RE are 
E= Caan aaa H= a 8 Wy (2) 
and using again the change of variable 
where a ‘ 
FB ee Oe (3) ee ees (12) 
sin 9 T I 
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for the first part in (11) and 

U Ul 

- =1-u, ig EET fs (13) 

1 T 
for the second part in (11) and using the symmetry 
property of the integrand which combines the variables 
v and v' together as presented in [1] the expression 
for the radiation resistance is finally obtained in the 
following form 

2a 


R= 2 fa (14) 


An Uv 
0 
This can be expressed exactly with the cosine integral 
function Ci(z), a special function tabulated in litera- 
ture [3] 


x 


[Ss =7+Inz — Ci(z) (15) 
v 
0 
where y = 0.5772 is the Euler-Mascheroni constant [4]. 


III. RADIATION IMPEDANCE 


From the expression for the radiated power only 
the radiation resistance, that is, the real part of the 
radiation impedance is obtained. There is also a re- 
actance part in the radiation impedance which comes 
from the effects of the oscillating fields (reactive fields) 
of a dipole antenna. These are near fields which vanish 
in far zone. However, the expression for the radiation 
impedance of a half wave length dipole antenna is ob- 
tained by using the method of induced electromotive 
force which results in [1] 


20 


Uy] 
Z=— 
Ar 
0 


1 —cosv _sinv 
Vv 


du (16) 


Similarly, the imaginary part (radiation reactance) is 
usually written with sine integral function [3] 


is Se Sha) (17) 


and the exact expression for the radiation impedance 
of a half wave length dipole antenna is 


= rede +In2n-Ci(2n)+jSi(2m)] (18) 


Instead of using the sine and cosine integral functions 
another approach is done here. 

Applying the similar change of variable as in the 
previous section the radiation impedance (16) is trans- 
formed to the form 


id i 1+cos(mcos@) sin (cos@) cos 
iG =i) sin 6 cas sin 6 | 
0 


(19) 
Let us write the impedance in terms of a real part and 
an imaginary part as 


Z=R+jXx (20) 


and consider them separately. 


A. Radiation resistance 


The real part of the radiation impedance is 


f 1 +cos (m cos 9) 40 


=i sin 6 
0 


(21) 


Using the expansion given in [3] 


cos (x cos@) = Jo 421 )* Joy, (x) cos (2k0) (22) 


the real part is 


© 1+ Jo(m) +2 3 (—1)™Jom(m) cos (2m#) 
m=1 


=— dé 
Ar sin 0 
0 
(23) 
Writing cos (2m6) = 1 — 2sin? (m@) and noticing that 
cost = Jo(m) +2 S> (=1)"Jom(m) (24) 
m=1 


the radiation resistance is —_ 


sin 8 


Se) wal “s 3 f sin? (mé) 
R= — FAY (ann / sin’ (nO) iy) a) 


Using the new summation index m + m+ 1 the real 
part of the radiation impedance is finally 


RS 2S anal | feet dé 


sin 8 
(26) 
B. Radiation reactance 


The imaginary part of the radiation impedance is 
“ora f sin (1 cos 8) cos 8 

~ An / sin 0 a a) 
0 


Using the expansion [3] 


[o) 
sin{a:cos.@) = 2 So (-1)* Jans (x) cos [(2k + 1)6] 
k=0 
(28) 
the imaginary part is 


Xe 
T a See 
i. [ 5 (—1)™Jom+1 (7) cos [(2m + 1)6] cos op 
0 


fra sin 6 


(29) 
Writing 
cos [(2m@ + 1)6] = cos (2m6) cos — sin (2mé) sin 8 
(30) 


EY. i 0 Pe [ype =r pewennrenecwerpeeri-—peer se 
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the angle dependent term inside the summation can be 
written in the following form 


cos [(2m + 1)6] cos@ = 1 — sin? m@ — sin? [(m + 1)6] 
(31) 
Inserting this expression into the summation and notic- 


ing that 
Sina —=12 ae 


the expression for the oe reactance is finally ob- 
tained 


ie i sin? mé 

ee AL oa ™m 

cpanel! Jamaa(a) | | aS 
m=0 0 


sin? [(m + 1)6] 
sin 8 | a 


JP Samii) (32) 


(33) 


In these expressions the part which depends on the an- 
tenna length is separated from the angle dependent 
part. The integration is restricted only to the angle 
dependent part which can be easily done. 


IV. COMPUTATION OF IMPEDANCE 


In this section the integration is done which leads 
finally to the value of the radiation impedance. The 
expressions inside the square brackets in (26) and (33) 
can be written in terms of Chebyshev polynomials of a 
second type [4] 


sin [(m + 1)9] 


Uj, (cos@) = ara 


(34) 


The integration is done which results in coefficients for 
the respective Bessel functions. These coefficients are 


An = | 2%(c0s8) sin 0d (35) 


and using a new variable z = cos@ the coefficients are 
obtained simpler in the form 


De [UR@e: = 2 f vr eyde (36) 
0 


The Chebyshev polynomials of the second type are 
given in literature [4], examples of some lowest order 
ones are 


Uo(z) = 1, U,(z) = 22, U2(z) = 4a? - VF 
Us(a) = 82° 42, U,(z) = 162* —122?+1, _... (37) 

There is a recurrence relation for the polynomials 
Un+i(£) = 2£Un(x) — Un-i (2) (38) 


from which all other polynomials can be evaluated 
when the two first ones are known [4]. After calculating 
the integrals the values of the coefficients are obtained 


8 _ 46 


Api = 2, Ay = —. =—, 
e aon tie Pres 


352 _ 3378 
Lhe ty pal Beary 
Now the radiation resistance and reactance can be 


written in terms of Bessel functions. The radiation 
resistance is 


3= (39) 


R=2 0-1) Andomiatn)\ (40) 


m=0 


and the radiation reactance is 


(-1)™[Am-1 + Am]J2m+i(7) (41) 


m=0 


In these expressions the coefficients A,, are given in 
éq17 and the coefficient A_; = 0. 

The Bessel functions J,(z) follow the recurrence re- 
lation [4] 


ee = Jn(c) — Jn—1() (42) 


from which the values for higher order functions can be 
easily calculated when the values for the two lowest or- 
der functions are known. The values for the two lowest 
ones are 


Jo(m) = —0.304242, J, (m7) = 0.284615 (43) 
From these values and using the recurrence relation 
(42) and the values for the coefficients (39) one obtains 
finally the value for the radiation impedance 


Z 73.122 + 742.558 (44) 


In this case only four components in the summation 
is needed since the values of the Bessel functions are 
converging very fast with high index values. The value 
(44) is exactly the same as found in literature [1]. 


V. CONCLUSION 


The impedance of a radiating half wave length 
dipole antenna is given in terms of Bessel functions. In 
this study the parts which contain the antenna length 
and the angle dependent parts are separated. The angle 
dependent parts are written in terms of the Chebyshev 
polynomials of the second type which after integration 
gives us the coefficients for the corresponding Bessel 
functions. The integration of the Chebyshev polyno- 
mials can be easily done. Because the values of the 
Bessel functions converge rapidly with high index val- 
ues only a couple of terms are needed in the series. For 
example, for a half wave length dipole antenna four 
terms in the series are enough to obtain the value of 
the impedance in accuracy of three decimals. 

This kind of study can be generalised to calculate 
the antenna impedance of wire antennas whose lengths 
are even multiples of the half wave length. Also this 
kind of Bessel function expansion may be useful in an- 
tenna synthesis to calculate current distribution for the 
required radiation pattern since the Bessel functions as 
well as the Chebyshev polynomials obey certain orthog- 
onality relations. 
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A new structure for a six port reflectometer 
by using a five port ring 


C. Ciminelli, A. D'Orazio, M. De Sario, V. Petruzzelli, F. Prudenzano 


Abstract— This paper presents a new structure for a Six-Port 
Reflectometer (SPR) which uses a symmetric five-port ring 
cascade-connected to a backward coupler. The performance 
of the ring-based SPR has been compared with that of a SPR 
which uses four 90° hybrids. Circuit design criteria, developed 
by means of commercial computer code Advanced Design 
System (ADS), show that the ring configuration allows to 
obtain a higher cut frequency and lower attenuation 
coefficients. 


Index terms—Six-port reflectometer. 


I. INTRODUCTION 


The six-port reflectometer (SPR) is a passive linear 
device which measures the complex reflection 
coefficient [ for a Device Under Test (DUT). Since 
the idea of SPR was proposed [1], it became an 
alternate approach to measure the complex ratio 
between two signals other than directly using a 
vectorial network analyzer. As the SPR structure is 
usually much simpler than that of a conventional 
network analyzer, it is a good candidate for 
integration. In a SPR, the amplitude and phase of 
complex IT is calculated from the measurement of 
power values at four output ports. 

The calibration procedure [2] basically consists of two 
steps. In the first step, the SPR is transformed into an 
equivalent four-port reflectometer: this requires the 
determination of five parameters to characterize the 
SPR. The next step determines the six real (or three 
complex) parameters to characterize the equivalent 
four-port reflectometer. 

There have already been a few approaches to realize a 
six-port reflectometer in MMIC technology. One 
possibility consists of using traditional structures and 
replacing transmission lines which would be too long 
for integration by equivalent lumped-element 
circuits [3]. In general, this approach gives rise to a 


Another possibility consists of using an essentially 
resistive structure: the resulting circuit operates over a 
larger bandwidth but shows considerable losses. In [4] 
a SPR which uses nonmatched diode detectors having 
a high input impedance, placed around a phase shifter 
in conjunction with a power divider for the reference 
detector, has been presented. A good largely 
employed configuration of SPR, which is derived 
from the original design of Engen [5], consists 
coupler, power dividers and 180° hybrids. 


In this paper, we present a new structure of a SPR 
which uses a microstrip five-port ring in cascade to a 
backward coupler. The performance of the ring-based 
SPR is evaluated by means of computer code 
Advanced Design System (ADS) from HP, and is 
compared with that of a SPR which uses four 
quadrature hybrids. 


Il. THE SIX PORT EQUATIONS 


SPR's are linear devices which are designed in a way 
that the power P3 detected at the so-called reference 
port depends in the ideal case only on the power wave 
b2 emerging from the measurement port which is 
proportional to the power provided by the source 
connected to port 1. The powers P4 to P6, at the 
remaining ports 4 to 6, of three different linear 
combinations of the wave bz and the wave a2? which is 
incident on this port, are measured. Let be Aj and Bj (i 
= 4, 5, 6) the coefficients of these linear combinations 
and B3 the proportionality factor between b2 and the 
wave measured by the reference detector at port 3, the 
following equations for the power ratios Pj/P3 are 
obtained: 


3 : : 2 
rather narrow operating bandwidth of the device. Poo |Ajaz + Bibo| _ Serer, (1) 
Ta 2 i tosis 
P3 [Baba] 
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By defining the reflection coefficient of DUT 
connected to port 2 of the SPR: 


p==2 Q) 
bo 
and introducing the quantities qj: 
B; 
i= (3) 
ick 


the equation (1) can be written in the form [1]: 


B47 
Aj 


f¥ i=4, 5,6 (4) 


ir -ail* = 


T is given by the intersection of three circles, the 
centers of which are the points qj and the radius is 
proportional to the ratios Pj/P3 of the powers 
measured by the detectors. An accurate determination 
of I requires that the points qj do not approach each 
other and are symmetrically distributed around the 
origin, i.e. separated by 120° at the vertices of an 
equilater triangle. 


Il. CALIBRATION TECHNIQUE 


In a six-port reflectometer, the amplitude and the 
phase of complex reflection coefficient I are 
calculated from the measurement of power at four 
output ports. The calibration procedure of a SPR is 
quite complex and the calculation of I from the 
measured powers requires a lot of computational 
efforts. 


The calibration procedure suggested by Engen in [6] 
consists of two main steps which allow to determine, 
in the first step. a virtual reflection coefficient W and, 
after, the effective [. In the first step, the SPR is 
transformed into an equivalent four-port reflectometer. 
This requires the determination of five parameters to 
characterize the SPR. The second step consists of the 
bilinear transformation W- IT which allows to 
determine the six real (or three complex) parameters 
necessary to characterize the equivalent four-port 
reflectometer. These parameters can be obtained by 
using different types of termination at the measure port 
and collecting the power readings of the detectors. To 
fulfill the requirements of the two calibration steps, in 
this paper, we refer to a sliding short. In the complex 
I’ plane, the locus of I’ values for the sliding short is a 
circle centered in the origin and of a nominal-unit 
radius. Therefore, by considering the well-known 


property of any bilinear transformation for which 
circles are mapped into circles, the locus in the W 
plane associated with the sliding short is a circle, too. 
In this way it is possible to derive an equation in 
closed form in five unknowns. Substtution of the 
measured power values corresponding to five sliding- 
short positions leads to a system of linear equations 
which may be easily solved. 


TV. RING -BASED SPR 


The configuration of SPR, shown in Fig. l, is 
constituted mainly by a five-port symmetric ring and a 
backward coupler, realized in microstrip technology. 
The construction conditions are: matching at six 
ports; 2) isolation between the measure port 2 and the 
reference port 3 where signal is proportional to the 
input one; 3) the transmission from the measure port 2 
to the ports 4, 5, 6 (and not towards the port 3 where 
there is the reference power detector). The substrate 
characteristics, common to all simulation, are: height 
of dielectric H = 0.508 mm, relative dielectric constant 
€ = 3.38, microstrip thickness T = 2 yum, loss factor tg 
5 = 0.002. The metal has been considered perfect. 


The ring has been designed by means of five circular 
patches having width we = 1.3 mm, radius R = 4.12 
mm, a = 72 ° and five equal lines having width wi = 
1.1 mm, length Lj = 2.5 mm while the backward 
coupler has the following parameters: w =1.17 mm, 
gap between lines s =0.25 mm, L = 2.5 mm. 


The evaluated reflection coefficients of the ring- based 
SPR at 7.8 GHz are: S;; = -33.97 dB, So2 = -47.89 
dB, S33 = -33.61 dB, S44 = -43.12 dB, S55 = -47.89 
dB, Seg = -43.12 dB. All these values show a good 
matching at the six ports. 

The evaluated transmission coefficients at the same 
frequency from the input port | to the others are: S$}? 


Fig. 1: Lay-out of the ring-based SPR. 
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-6.16 dB, S}3 = -18:.47 dB, S)4 = -6.1 dB, Sj5 = -6.16 
dB, S16 = -6.1 dB while the transmission coefficients 
from the measure port 2 to the others are: S23 = -37.5 
dB, S24 = -6.015 dB, S25 = -6.12 dB, S26 = -5.99 dB. 
Besides notice the uniformity of the attenuation of the 
waves, equal to about -6 dB, it is important to outline 
the good isolation between the port 2 and 3 expressed 
by the very low value of the coefficient S23 = -37.5 


\ 


Fig.2: Modulus (a) and phase (b) of the centers qu, q5, 
d6- 


Fig.2 shows the modulus and the phase of the centers 
qi evaluated as a ratio of S parameters. In particular, it 
results: q4 = - S41/S42S21, qs = - S51/S52S21, 96 
= - §6)/S62S21._ At 7.8 GHz, the moduli assume the 
value of about 2 while the phase differences between 
two adjacent centers are equal to 120°, as requested. 


V. HYBRID-BASED SPR 


The configuration of this type of SPR, shown in Fig. 
3, includes five quadrature hybrids, a 3 dB attenuator 


Fig.3: Lay-out of the hybrids-based SPR 


and two matching loads, realized in microstrip 
technology. The quadrature hybrid is designed with 
reference to a circular structure with four ports having 
simply matching lines and two short-end stubs. 


The geometrical parameters are: a= 96°, B = 42°, 
width and radius of curvature of circular patches we = 
1.304 mm, R = 3.08 mm, short-end width and length 


j 8 7.0 %F7.P 7.4 7.8 Va ae a.f a4 ao 88 


trea, GHz 


{a} 


Fig.4: Modulus (a) and phase (b) of the centers qq, qs, 
q6- 
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Wo = 1.304 mm, do = 2.5 mm, matching lines width 
and length ws = 1.304 mm, ds =2.5 mm, while the 3 
dB attenuator has been realized by a T resistors 
junction with resistences equal to 8.58 Q, 8.58 Q, 
141.42 Q. 


The evaluated reflection coefficients of the ring- based 
SPR at 7.8 GHz are Sj; = -47.15 dB, S22 = -50.71 dB, 
S33 = -48.55 dB, S44 = -6.27 dB, S55 = -9.4 dB, See = 
-9.35 dB which show a good matching only at the 
ports 1, 2, 3. 


The evaluated transmission coefficients at the same 
frequency from the input port | to the others are: $}2 = 
-6.58 dB, S}3 = -9.19 dB, S)4 = -9.17 dB, Sis =- 
925 dB. Sig = -9.2 dB while the transmission 
coefficients from the measure port 2 to the others are: 
S23 = -64.15 dB, S24 = -9.13 dB, S25 = -12.2 dB, S26 
= -12.2 dB. The isolation between the port 2 and 3 is 
now better being S23 = -64.15 dB. 


Fig.4 illustrates the modulus and the phase of the 
centers qi; it shows that, at 7.8 GHz, the circle centers 
are not at the vertices of an equilateral triangle, even if 
their reciprocal distance assures a securely good 
bandwidth. 


VI. COMPARISON 


The performance of two SPR configurations over a 
bandwidth of 20 GHz has been simulated. We have 
verified the characteristics of the ring-based SPR are 
securely more uniform with respect to those of the 
hybrid-based SPR. In particular, for the ring based 
SPR, the S)3 parameter assumes values in the range of 
design limits, for frequencies greater than 6 GHz, and 
exhibits the minimum attenuation at about 16 GHz. 


The parameters S}2 and S)5 and the parameters S>4 
and S95 have the same shape and show attenuation 
less than 10 dB till 10 GHz. The Sj5 parameter has 
attenuation less than 10 dB till 16 GHz. The S)3 
isolation coefficient shows the desired behaviour in the 
range 1-7 GHz, decreasing from the value of -56.24 


dB to the value of -38.945 dB which remain constant 
till to 15 GHz. 


For the hybrid-based SPR, the S 3 coefficient shows 
three negative peaks equal to -40dB till to 18 GHz. At 
the high frequencies, the isolation coefficient is lower 
than that of the ring-based SPR and has values in the 


range -40 -20 dB. With reference to the other 
transmission coefficients, the attenuation is about 20 
dB in the whole range except for frequencies near 8 
Hz. 


From these considerations, we can conclude that the 
ring-based SPR exhibits a greater cut-off frequency 
equal to about 14 GHz. 
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Influence of Shallow Impurity on 
Steady—State Probability Function 
of Multilevel Deep Impurity: 


Julijana Divkovic Puksec' and Vera Gradisnik’, Member, IEEE 


Abstract - The deep impurity add into the n- or p-type of a 
semiconductor is partially ionised. The probability function used 
to describe the occupation of a deep energy level, is the Fermi- 
Dirac function into which the entropy factor is introduced; Xp 
for donor level or x, for acceptor level. The entropy factors are 
used to adjust the calculated and measured values. An effective 
deep energy level was defined, depending on the predicted 
position of a deep level and on obtained entropy factor. 
Comparing the calculated and measured values for gold and 
platinum added into the n- and p- type of silicon, we can see that 
the same predicted energy level is described with the quite 
different entropy factor in the n- and p- type of a semiconductor. 
According to the obtained positions of the effective deep energy 
levels, it can be concluded that in the compensation between 
shallow and deep impurity a deep level, which is nearest to the 
shallow level, must be considered. The other levels are neutral. It 
might happen that in the n-type of semiconductor the higher 
acceptor level of platinum is occupied, while the lower one is 
empty. It seems that such a neutral energy level does not exist in 
the n-type, while in the p-type it does, and it is partially occupied. 


Index Terms- multilevel deep impurity, entropy factor, deep 
energy level, probability function. 


I. INTRODUCTION 


In general, the deep impurity can introduce more than one 
deep energy level into the semiconductor band gap. 
Theoretically, the monovalent deep impurity, can introduce 
four energy levels into four-valence semiconductor [1]. One 
level is of donor type, while the other three are of acceptor 
type. Fortunately, only a few of them really appear in the 
semiconductor band gap. 


The density of the free carriers, electrons (n) and holes (p), 
depend on the density of the ionised impurities - shallow 
and deep. The shallow impurities are completely ionised at 
room temperature. One part of the deep impurity shall 
behave as a donor N7p, the other as an acceptor Nz, while 


the rest shall be neutral. The concentrations N74 and Nrp 
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depend on the concentrations of both, deep and shallow 
impurities. The charge neutrality equation for a 
semiconductor having shallow donor and acceptor (Np, Na) 


and an amphoteric impurity N7 is expressed as: 


The semiconductor resistivity, p, depends on the density of 
the free carriers (electrons, n and holes, p), and its mobility 


(Up Up) according to 
1 


p= (2) 
q-(n-y +P Hy) 


Comparing the measured and calculated values of the 
semiconductor resistivity, having gold or platinum and 
shallow donors or acceptors, we shall try to examine the 
influence of the type of shallow impurity onto the ionisation 
of a deep impurity. 


II. GOLD AND PLATINUM IN SILICON 


In silicon gold exhibits two, and platinum three deep energy 
levels [2, 3, 6, 9, 10]. Gold introduces one acceptor and one 
donor level; the acceptor level is far away from the 
conduction band for about 0,54 eV, the donor level from the 
valence band for about 0,34 eV. Platinum has one donor 
and two acceptor levels; the donor level has a position at 
0,32 eV above the valence band, the second acceptor level 
is situated 0,23 eV under the conduction band. But these is 
a great uncertainty about the first acceptor level; it is 
positioned at 0,72 eV, 0,49 eV [6], 0,4 eV [9] under the 
conduction band. Moreover in [2, 4] this level is missing. 


The position of deep energy levels introduced by gold and 
platinum in a silicon considered here, according to [2, 3, 6, 
9], are located at: 
gold: platinum: 
Ep =E, + 0,35 eV Ep =E, +0,32eV 
E, =E, —0,54eV E,, =E, —0,4eV 
Exp = Ee aed 0,26 eV 
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Figure I — The resistivity as a function of the density 
of gold and platinum in n-type of silicon 


III. STEADY-STATE STATISTICS OF DEEP IMPURITY 


The steady state statistics for multilevel deep impurities is 
given by Shockly and Last [5, 6]. It assumes that the lower 
level has greater possibility of occupation, and that the 
higher level shall be occupied only if the lower level 
already has an electron. The same result shall be obtained if 
the Fermi-Dirac statistic is used, keeping in mind that the 
occupation of one level depends on the occupation of the 
another one [4, 7]. 


The probability functions used in [4] are the Fermi-Dirac 
probabilities. According to the [2, 3] the entropy factors are 
used to adjust the probability of ionisation to a value that is 
in the best agreement with the measured one; for the 
acceptor level it is the entropy factor for an electron x, and 
for the donor level Xp is the entropy factor for a hole. The 
probabilities are defined as follows: 


fle,)=) 1 ono SF J 
f(E,)=1- f(E,) 5 


=I 
jlEp)=|1+1, -en{ == Fe. 
(4) 


f(Ep)=1- f(Ep) 


In the [2, 3] the influence of gold and platinum on the 
resistivity of n- and p-type of silicon is considered. In these 
articles it has been claimed that, in each examined case only 
one level is important, while the other have no influence. 


The effective ionised concentration of gold can be 
expressed as: 


Nr -Nqp = Nrlf(E,)- F(Ep)) 


while the behaviour of platinum is described by: 


Nia —Nop = Nr [F(En2)+ f(Em)-FlEd)| 


Mostly, at the multilevel deep impurity only two levels 
must be considered; sometimes, only one level shall be 
important. 


IV. EXPERIMENTAL AND CALCULATED RESULTS 


In the experiments described in [2, 3] the n- and p-type 
samples of silicon, both having a resistivity of 16 Qcm are 
used. According to this resistivity shallow impurity is 


estimated to be Np=2,8:10!4 cm™3, and N4=8,6:10!4 cm, 


respectively. Gold or platinum were then introduced and 
their influence on the resistivity observed. 


Fig. 1 presents the measured and calculated values of the 
silicon resistivity, primarily doped with shallow donors 
after which gold or platinum were added. 


As it can be seen, both deep impurities act as-.an acceptor; 
the gold is a stronger one. Such a result was expected, 
because the gold acceptor level is lower than the platinum 
acceptor level. If the density of gold is high enough it will 
convert the n-type of silicon to p-type, as it is indicated in 
Fig. 1. This capability of gold to convert the n-type of 
silicon to p-type has been already known [8]. 


Although platinum also acts as an acceptor, for all the 
observed densities of platinum, the semiconductor remains 
the n-type. In the comparison with gold, the density of the 
platinum atoms ionised as an acceptor is very small. 


It is clear that the donor levels of both deep impurities are 
completely occupied and neutral; the upper acceptor level 
of platinum E,, is empty, so this level is neutral too. The 


only important level is a deep acceptor level E, for gold and 
E,, for platinum. 


p,Qcm 


1000 mee 
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xxxxx Au calculated 
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Figure 2 -The resistivity as a function of the density 
of gold and platinum in p-type of silicon 
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The entropy factors x, of the acceptor level, used in (3) in 
the n-type of silicon are for gold y,=8 [3] and for platinum 
Xn =1000. 


On Fig. 2 the starting material is p-type of silicon. At first 
glance, the influence of gold and platinum might seem the 
same, which can be attributed to the similar positions of 
their deep donor levels. It can be concluded that their 
acceptor levels shall be empty, unionised and shall have no 
influence on the densities of electrons and holes. Although 
the platinum acceptor level E,, is higher than the level E, 


of gold, its influence is evident, as it can be seen from the 
measured data [2,4] given in Fig, 2. It is clearly seen that 
for the greater density of the deep impurity, the platinum 
doped sample has a lower resistivity than the gold one, 
which means that the sample doped with platinum is a 
“stronger” p-type. 

In [3] the calculated values of the entropy factor for the 
gold donor level in the p-type of silicon is obtained as 
Xp=28, while the influence of acceptor level is negligible. 


In [2] considering platinum, the same model is used, only 
one level, Ep, is taken into account and the agreement with 
the measured values is vague. If both levels are taken into 
account, the donor level Ep and the first acceptor level E,/, 
the excellent agreement between the calculated and 
measured data is obtained for the platinum in the p-type of 
silicon, as can be seen in Fig 2. The entropy factors used in 
(3) and (4) are ¥,=8 and x,,=0,0005. 


In all calculations the mobility of electrons and holes are 
calculated as a function of the net density of all the added 
impurities according to [10] and [11]. 


V. DISCUSSION OF THE RESULTS 


The entropy factors used in (3) and (4) adjust the 
probability functions to such values that the best agreement 
with the experimental results will be achieved. 


We can imagine an effective value of acceptor level which 
will be defined as: 


E, = E neg —~Er 
OG a DY i) 
Xn iT } x i. (3) 


An effective value of donor level will be: 
E,-E E,-E 
a : exp =F FD on 2a) (6) 


According to the predicted positions of the deep energy 
levels and the obtained entropy factors, using relation (5) 
and (6), the following positions of the effective levels are 
obtained: 


the gold: 


n-type: E, =Ec- —0,54 eV, An =50 
E neff = Ec —0,44 eV 
p-type: Ep =E,+0,35eV, Xp = 28 


Epeg = E, +0,26eV 


the platinum: 

n-type: E, =E-—0,4eV, XZ, = 1000 
E peg = Ec —0,22 eV = Egy 

p-type: Ep =E,+0,32eV, Lp =25 
Ene = E, +0,24eV 
E, =Ec—-0,4eV, 
Exneg =Ec —0,6eV =E,, 


X, =5:10* 


Having in mind those values of the effective positions of 
the deep energy levels we shall try to imagine how the 
compensation between the shallow and deep impurity goes 
on. 


In the n-type, the donor levels in both gold and platinum, 
are occupied and neutral; the acceptor level of gold E,, and 


the higher acceptor level E,, of platinum are partially filled 


with electrons. The lower acceptor level of platinum is 
empty and neutral. These situations are shown on Fig, 3.a 
and Fig 3.b. 


b) platinum 


Figure 3 — Compensation between the shallow 
donors and deep impurity 
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In the p-type of semiconductors, where the shallow 
acceptors are compensated by the deep impurity, the 
acceptor level of gold E,, and higher acceptor level of 
platinum E,, are empty. The deep impurity shall send a few 
electrons from its donor level Ep to the shallow acceptor 
level. We can see that the platinum has a few electrons in its 
acceptor level. These electrons can be captured during the 
generation and recombination process. These situations are 
shown on Fig. 4a and Fig 4b. 
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a) gold 


b) platinum 


Figure 4 — Compensation between the shallow 
acceptors and deep impurity 


CONCLUSION 


Considering the n-type of silicon we can see that the donor 
level of both deep impurities has no influence on the 
concentrations of free carriers. Because of its position this 
level is completely occupied by electrons and neutral. The 
shallow donors are sending their fifth electrons in the 
conduction band and in the deep acceptor level. In the case 
of platinum the occupation of acceptor level is so small that 
it looks like the level E,, which is the higher one is 
partially ionised, while the first acceptor level E,, is empty 
and neutral. 

In the p-type of silicon the shallow acceptor level takes the 
electrons from the valence band and from the deep donor 
level. In the case of gold the acceptor level is empty and 
neutral. In the case of platinum the first acceptor level Ex) 


captures a few electrons created in the generation and 


recombination processes. The second acceptor level is 
empty and can be ignored. 

It can be concluded that in the case of multilevel deep 
impurity it is hard to say in advance which level(s) shall 
participate in the compensation with shallow impurity. It 
seems that the most important deep level is closest to the 
shallow one. Sometimes it is only one level which is 
important; this is the case with gold in n-and p-type of 
silicon, and with platinum in the n-type. But sometimes two 
levels must be considered, as we have in a case of platinum 
in the p- type of silicon. 


The great differences of the entropy values for the acceptor 
level E, of platinum, perhaps explain the different positions 


of this level, which can be found in literature [2, 4, 6, 9]. 
May it be that the platinum in the n-type and p-type of 
silicon exhibits different numbers and different positions of 
deep energy levels? The level which is completely 
unionised and neutral has no influence; we may say that 
such level does not exist. So, may we conclude that the gold 
in n-type of silicon has only an acceptor level, while in the 
p-type it has only a donor level? Perhaps it depends not 
only on the type of a shallow impurity, but on its density 
too. In order to find out the real situation, many more tests 
must be carried out, using different densities of shallow 
donor and acceptor, at various temperatures. 
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Series- Voltage Noise Margin of CMOS * 


Aleksandar Szabo, Member, IEEE, and Zeljko Butkovié, Member, IEEE 


Abstract—The concept of noise margin is very important in the 
design and application of digital logic circuits. Noise margin is 
the maximum spurious signal that can be accepted by the 
device when used in a system, whilst still operating correctly. 
In this work the methods of determining the static and 
dynamic series-voltage noise margins and the obtained results 
for CMOS are presented. 


I. INTRODUCTION 


Noise margin is the maximum spurious signal that can be 
accepted by the device when used in a system whilst still 
operating correctly [1]. Static noise is a disturbance which 
is changing very slowly. Its effect is depending on its 
amplitude and not on its duration. The influence of dynamic 
noise is a function of its amplitude and duration. Four basic 
types of noise sources exist: series-voltage noise, 
parallel-current noise, voltage noise at the ground line and 
voltage noise in the power-supply line [2]. 


Noise can appear in one point or in more points in a digital 
system. Best-case static noise margin is the magnitude of 
maximum noise concentrated somewhere in one point in an 
infinitely long chain of identical gates that does not disturb 
the proper logic operation of the gates. Worst-case noise is 
defined as a disturbance which is present in all logic gates 
in an infinitely long chain of gates [3]. In such a case, 
instead of an infinitely long chain, a flip-flop configuration 
is used (Fig.1). Noise margin is the signal which brings the 
flip-flop to the edge of switching to the wrong state [1], [2]. 
In this paper the determination of the worst case 
series-voltage noise margin for CMOS is presented. 


Fig. 1. Flip-flop with noise voltages. 


The authors are with the University of Zagreb, Faculty of Electrical 
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II. NOISE MARGIN DETERMINATION METHODS 


In the literature various criteria have been formulated for 
static noise margin determination [1]-[4]. In most cases the 
starting point is the transfer characteristic of the basic 
inverter of the logic family under consideration. In this 
paper the basic CMOS inverter (Fig. 2) will be considered. 


Vs5 
Ipp | D 
G ie 


Fig.2. CMOS inverter. 


The unity slope method is often used, but this method 
mostly does not give correct solutions. The determination 
of the noise margin of ECL is a good example for this. The 
statement that the unity slope method and the maximum 
square method in the case of symmetric transfer 
characteristics give same results is not true [1], [6], [7]. So 
this method will not be used here. 


Using the maximum square method the worst-case static 
series-voltage noise margin can be found geometrically by 
considering the maximum possible square between the 
normal and mirrored transfer characteristic [1], but only 
when R,,, <<R;,, because in this case the shape of the 
voltage transfer characteristic is invariant with respect to 
loading [3]. Fortunately, this condition is fulfilled for 
CMOS. 


The flip-flop method gives always the correct solution for 
the static series-voltage noise margin. The voltage Via, 
(Fig. 1) should be raised slowly starting from zero until the 
value at which the flip-flop changes its state. The flip-flop 
method can also be successfully used for determining 
dynamic noise margins by changing the amplitude and 
duration of pulses applied to the flip-flop. 
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Ill. STATIC SERIES-VOLTAGE NOISE MARGIN 
DETERMINATION 


Using the MOSFET current equations [5] the transfer 

characteristic of the CMOS inverter (Fig. 2) can be 

calculated. 

For the n-channel device in the triode region 

Tpn = Kn [2 Vos Ven) Vs -V3s], 9< Vos $Vos —Ven (1) 

and in the saturation region 
I Dn = Kn Ves ie ee 


The constant k,, is given by 


0<Voes —VinSVps. (2) 


— Hn Fox W 
pay hot 


n > 


where yu, is the electron mobility in the channel, ¢,, is the 


dielectric constant of the oxide insulating layer, /,, is the 
thickness of the oxide under the gate, and W and L are 
channel width and length, respectively. 


For the p-channel device the current equations are more 
conveniently written in the following form, for the triode 
region 


Tpp =kp [2 Use oS Vr )Vsp = >I, O<Vgp <Vsg+Vrp (3) 
and for the saturation region 


Isp =k, Vse +Vrp), 


The constant k, for the p-channel device is defined as for 


0<V5g +Vr <V op. (4) 


the n-channel device by changing the electron mobility 
with the hole mobility 4, . 


The currents through both transisiors in the CMOS inverter 
are equal. When 7,, is in the saturation region and Tin 


Al T,, in saturation 
T,, off T,, in triode region 


Ue in saturation 
T,, in triode region 


ie off 


0 1 2 3 4 5 VV 


i 


Fig.3. Analytically calculated transfer characteristic, 


the triode region 
k, v, =e Vin) = 

=k, b Wss —Vi +Vrp) Vss —Vo)-Vss -Voy 
taking into account that 


Vsg =Vss -V,; and Vsp =Vss -Vy. 


(5) 


When 7, is in the saturation region and 7, in the triode 
region 
ky Vss -Vi + Vip = kn 2 Vi - Vin) Yo -V2]. ©) 


When both transistors are in the saturation region 


kn Vi —Ven¥ =k Vss —Vi + Vip)’ () 


where from 


V=V kyl kn Vss 4+Vap) tia (8) 
1 1sat l r Vkp kn 9 
In the transfer characteristics (Fig. 3) the abrupt transition is 
given by 
AV, =Vrp = Vp. (9) 


For real CMOS inverters this transition is sharp, but it has a 
finite slope because the drain current is not independent of 
the drain to source voltage. 


For example, if we take k, =k, and V;,=-V7,=1V, 
equation (5) and (6) are simplified into 


V; -V, +Vss -1)P =(2Vs5 - 4) Vss -V-), (10) 


V. -Vi +Vss -1)° =(2Vs5 - 4)Vo. (11) 
Differentiating (10) 
OV, a f/av, 
OU ee ole 


and putting 0V,/0V,;=-1, we get the parabola axis 
equation 
V, =V, + a (12) 


Taking Vss =5V, from (12) and (10) we can find the 
coordinates of the point A (Fig. 4): 


A(aery, 
8 8 
or V; =2.125V and V, =4.375V. 


The coordinates of the point B can be found calculating the 
intersection of the line (12) and the mirrored transfer 
characteristic 


Vj -v, +4P =6%,. (13) 
The coordinates are 


“qc a eee 
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Output voltage (V) 


() 1 2 3 4 
Input voltage (V) 


in 


Fig.4. Maximum square method applied to analytically 
calculated transfer characteristic. 


eal 
8 8 


or V,;=0.375V and V,=2.875V. We now have the 


diagonal of the maximum square between the normal and 
mirrored transfer characteristic. The noise margin given by 
the size of the maximum square is easily calculated 


PSPICE analysis enables more accurate modeling of the 
transfer characteristic for a real CMOS process. Our 
calculation is done with the MOSFET parameters from [8], 
using the Level 3 model. The threshold voltages are 
Vin =—Vrp =1V, and coefficients k, =0.2mA/V? and 


ky = 0.24 mA/V2. As these models describe the channel 


length shortening in the saturation region the calculated 
transfer characteristic, shown in Fig. 5, has the finite slope 
in the region of its maximal variation. Although based on 


Output voltage (V) 


ie) 1 2 ib) 4 


wn 


Input voltage (V) 


Fig.5. Maximum square method applied to transfer 
characteristic calculated using PSPICE. 


the real process, the obtained results from Fig 5. differ only 
slightly from the previously shown analytical solutions. The 
size of the maximum square gives 


NM =1.80V. 


The same result is obtained with PSPICE analysis making 
use of the flip-flop method. The flip-flop method gives 
always the correct solution [9], [10]. In the flip-flop the 
series noise voltages Vj,, from Fig. 1 are changed slowly, 
starting from zero. At the moment when the voltage reaches 
the noise margin the flip-flop changes its state (Fig. 6). 


Voltage (V) 


0.0 0.2 0.4 0.6 0.8 1.C 
Time (s) 


Fig.6. Flip-flop output voltages change when 
the noise margin is reached. 


IV. DYNAMIC SERIES-VOLTAGE NOISE MARGIN 
DETERMINATION 


To get the dynamic series-voltage noise margin the flip-flop 
method and PSPICE are used. Calculations of the dynamic 
noise margin as a function of the rectangular noise voltage 
pulse width and the capacitive load as a parameter are 
carried out (Fig. 7). 


Dynamic noise margin (V) 


Pulse width (ns) 


Fig.7. Dynamic noise margin 
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V. CONCLUSION 


The manufacturers define the static noise margin as the 
difference between worst-case logic levels at the output and 
the input of logic gates, separately for the low state, and for 
the high state. In manufacturers’ specifications the 
guaranteed minima of the noise margins, taking into 
account all tolerances, are given. These noise margins for 
CMOS are 30% of the supply voltage. For Vss =5 V_ this 
would be 1.5 V. The obtained results are nominal values 
without tolerances and are a little bit higher (36% of Vs ). 


The use of correct methods for the determination of the 
noise margin with nominal values can be used for accurate 
comparisons of noise margin capabilities of various logic 
gates and different logic families. 
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Simulation of Large-Signal Behavior 
of a GaAs Low Noise Amplifier * 


Adrijan Bari¢é, Member, IEEE, and Zeljko Butkovi¢, Member, IEEE 


Abstract— This paper investigates the linearity of a GaAs 
MESFET amplifier. A two-stage low noise amplifier is 
analyzed by PSPICE. The large-signal S parameters are 
determined for different load conditions, ie. a resistive load, 
an active self-biased load and a cascode configuration. 
Additionally, the linearity is described in terms of two-tone 
intermodulation products of the 3'¢ and 5™ order, as well as 
the 3° order intercept point (JP, ). 


I. INTRODUCTION 


The linearity is of primary importance in wireless 
communication systems because of stringent regulatory 
emission requirements [1]. This paper investigates the 
linearity on the example of a two-stage intermediate 
frequency low noise amplifier. The analysis is performed 
by PSPICE [2]. The first stage is a common-source 
amplifier, which ensures necessary gain, and the second 
stage is a common-drain amplifier that ensures output 
match in a wide frequency range. The basic amplifier with a 
pure resistive load Rp is shown in Figure 1. 


Firstly, an S parameter simulation setup is described and the 
results for S parameters are given as a function of the 
available input power. The S parameters are simulated for 
three different load conditions. Besides a pure resistive 
load, an active self-biased load and a cascode configuration 
is used. These configurations are shown in Figure 2. 


Additionally, the intermodulation products of the 3“ and 5" 
order are calculated for the amplifier with a resistive load at 
100 MHz for a wide range of input power levels. Finally, 


Fig. 1. Low noise amplifier with a resistive load. 


The authors are with the University of Zagreb, Faculty of Electrical 
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Fig.2. Different loads: (a) self-biased, (b) cascode. 


the output intercept point is calculated at 100 MHz for all 
three types of load. 


II. SSPARAMETERS 


The S parameters are defined with the set of equations 
b, = Si Q + S15 Qe (1) 


by = S3, a + Sy a2, (2) 


where a, and a» are the incident and 6, and 6, are the 
reflected waves at the input port 1 and output port 2. The 
waves are related to the voltages and currents at the same 
ports by [3] 


uy + Zo i u, —Zo 
a, = ———_., b, =————_., (3) 
RRR EIA 
42 +Zoh Uz — Zo bo (4) 


Pages eins Ail Bae eit Cora 
oon F- em anya 


For most applications the characteristic impedance of the 
system Z,y =5S0Q. 


The parameters S,,; and S5, are extracted using the circuit 
shown in Figure 3a [4], with two identical source voltages 
u,. The output port is terminated with the characteristic 
impedance Zy) so the incident wave a,=0. From this 
circuit . 
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a fen (8) The parameters S,, and S,. are extracted in a similar way 
by using the circuit in Figure 3b 


It is clear that the parameters S,, and S,, are calculated as 
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Figure 4 presents the magnitudes of S,,, S2,, Sj, and 
Sy obtained when the simulations are performed using a 
pure resistive load. 


The simulations performed by using an active self-biased 
and cascode load show very similar behavior, except for the 
forward transmission S,, that is larger in the case of the 
cascode load at lower frequencies, as expected. However, 
the gain of the cascode load actually falls below the value 
of the gain obtained for the pure resistive load at higher 
frequencies, which was quite unexpected as the cutoff 
frequency of the simulated MESFETs is around 20 GHz. 
Figure 5 shows the magnitude of S$, for resistive, active 
and cascode load as a function of frequency for the input 
levels of -56 dBm and -36 dBm. 


resistive load 
actvie load 
cascode load 


— 56 dBm 
-— 36 dBm 


7 


o--0-.__*s 


Frequency (MHz) 


Fig.5. Magnitudes of S,, parameter for different loads 
at two input levels. 


III. INTERMODULATION PRODUCTS 


For the sake of completeness, the intermodulation products, 
(see e.g. [5]), are calculated for the case of a pure resistive 
load using the input signals at 100 and 110 MHz. The 
power of the fundamental component shows the expected 
increase of 1 dB when the input power is increased | dB, at 
low input power levels, while the /M3 product shows the 
expected slope of 3 dB (see Table I and Fig. 6). 


Calculating from the results in Table I, the output intercept 
point of the 3" order (OJP, ) is -1.35 dBm (resistive load). 
In the case of the active load OJP; = 0.1dBm, and OIP; is 
+0.5 dBm for the cascode load. These results are obtained 
by using the single tone frequencies of 100 and 110 MHz, 
thus, higher O/P, values result from a higher gain (i.e. $5, ) 


of active and cascode load. However, that would change at 
higher frequencies, which can be concluded from the results 


for |.S>, | shown in Fig. 5. 
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Fig.6. Output power of the fundamental component 
and JM, product. 


Table I Fundamental component and intermodulation products. 


Available input power - Fundamental component - 
single tone (dBm) 110 MHz (dBm) 


IM, product - IM. product - 
120 MHz (dBm) 130 MHz (dBm) 
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IV. CONCLUSION 


The linearity of a low noise intermediate frequency GaAs 
amplifier is investigated by calculating the large-signal S 
parameters, for the resistive, self-biased active and cascode 
load in the frequency range from 50 MHz to 500 MHz. The 
S parameters are found to behave similarly, except for the 
forward transmission S>, that is larger for active and 
cascode load at lower frequencies, and unexpectedly lower 
at higher frequencies. Additionally, the JM; and /M; 
products are simulated for the amplifier with the resistive 
load at 100 MHz. Furthermore, the output intercept point is 
calculated for all three types of load and it is found that the 
results conform to the simulated large-signal S parameters. 
It is worth noting that the amplitude and phase 
characteristics of S,,; may be used to obtain AM-to-AM 
and AM-to-PM characteristics used in behavioral modeling 
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of the systems that use wideband modulated signals, e.g. 
CDMA [6]. 
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CAD Modeling of MIC Multiport Power 
Combiners/Dividers Fabricated on the Base of 
Extended Concept of Hybrid Ring Structure 


Abdulgader Z. Abdalla’, Bogdan J. Janiczak”, and Jerzy Mazur’ 


Abstract— The results of theoretical and experimental studies 
of symmetrical power dividers in integrated microstrip-line 
based planar technology are shown and concluded. Presented 
work focuses on the application of well-known hybrid power 
combiner/divider. Modified version of the original structure is 
proposed to achieve enhanced performance of in-phase power 
divider around mid-band frequency for classical 3-dB 
structures. The concept is then further extended on the 
modeling of different multiport combiners/dividers in either 
in- or anti-phase mode of excitation. The achievements of this 
study demonstrate the accuracy of applied simulation tools as 
well as potential utility of analyzed structures for the 
construction of quality integrated planar integrated circuits 
such as balanced components and multi-element feeders. 


Index Terms—MIC Dividers and Couplers. 


I. INTRODUCTION 


In almost every high frequency system application, 
including dynamically § expanding mass __ scale 
telecommunication market, user can easily recognize 
components responsible for power dividing or combining. 
These components of high frequency circuitry are used 
either to monitor the state of the system or to provide 
multichannel operation. Dependently on system 
requirements both symmetrical and asymmetrical power 
division options can be met in practice. One of the key 
elements appearing in these applications is usually a hybrid 
3dB power combiner/divider allowing two way 
transmission of the same power level signals [1]. 


Equal magnitude power dividers in integrated technology 
can be performed by use of only few circuit elements 
including branch line couplers, proximity couplers, 
Wilkinson type power dividers or hybrid ring structures [1], 
[2]. Whereas branch line couplers are usually narrow band 
components, unbeatable by means of available frequency 
operation range 3-dB proximity couplers are usually 
extremely difficult for realization [2], [3]. In many cases, 
thus designers of high frequency circuits and systems are 


' Higher Institute of Electronic Eng., PO Box: 38645, Beni-Walid, Libya. 
2 Dept. of Microwave Engineering & Optical Telecommunication, 
Technical University of Gdansk, Narutowicza 11/12, 80-952 Gdansk. 


left with two major classes of power combining/dividing 
elements, i.e. Wilkinson-type and hybrid-ring structures. 
The latter comparing with Wilkinson-type power divider do 
not require additional lumped elements (resistors) and their 
useful operation band is approximately twice as much as 
that of branch line structures. The interest in these types of 
components is still observed in scientific literature [4], [5] 
due to very practical reasons. 


In this paper, therefore, an approach to the modeling of both 
conventional 3-dB and multiport power combiners/dividers 
based on the application and extension of classical hybrid- 
ring components fabricated in integrated microstrip line 
technology will be described and illustrated whenever 
possible by representative numerical and empirical data. 
The early achievements presented in [6] has been further 
expanded to prove the possibility of efficient design of 
multiport components based on rather topologically simple 
structure. 


II. OUTLINE OF THE MODELING APROACH 


Two structures are analyzed in this part of the paper with 
the help of a commercial simulator [7] that allow equal 
magnitude 3-dB power split of high frequency signals 
namely conventional and modified hybrid-ring dividers, 
shown in Fig.la and Fig.1b, respectively. The modified 
structures has only three ports and in two points is 
symmetrically loaded with terminating impedances Z,=50Q 
thus resembling, to certain degree, technique known from 
Wilkinson-type power dividers. 


L1, W1 | L2, W2 eisai wi pe w2 


Ti | = L3, W3 
L3, W3 
(a) 


Fig. 1 Simplified topological view of conventional (a) and modified (6) 
hybrid-ring based power combiner/divider. 
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IV. MULTIPORT COMPONENTS 


The results presented in the previous sections have clearly 
demonstrated not only the rationality of the proposed 
approach, but also quite visible superiority of the 
symmetrically loaded hybrid ring structure over its 
predecessor in case of in-phase excitation regime. In many 
cases the multiport circuit is constructed as a both uni- and 
bi-directional combination of the base structure (Wilkinson 
type power divider, branch, hybrid-ring or proximity 
couplers). This approach leads to so called tree-like or 
corporate structures. 


If only branch line, hybrid-ring, or proximity couplers are 
used the designer will certainly meet the problem of 
excessive number of ports that has to be loaded by 
terminating resistors. The number of additional loads, 
usually due to required quality of design, which are made in 
coaxial-line technology will match the number of couplers 
used in the system. In case of Wilkinson-type multiport 
power dividers designer is usually limited by technology. 
The extension of conventional Wilkinson structure to 
multiport one will result in technological problems due to 
high impedance line restrictions (width of the strip). 


Here we propose a practical solution to the design of 
multiport power combiner/divider based on the topological 
extension of previously discussed conventional and 
modified (symmetrically loaded) hybrid-ring structure. 
Exemplary topology of the six-port and modified seven- 
port hybrid-ring based solutions, being the first in the row 
of practical multiport arrangements are shown in Fig. 5-a 
and -b, respectively. 


a yaercitieaeer 
BJ: 


6 a a 
@ (b) 


Fig. 5 Topology of a six-port (a) and a seven-port (b) extended hybrid 
based power combiner/divider. 


Of course for the constant 3-dB power increment between 
consecutive outputs the line dimension of particular 
sections are exactly the same conserving thus uniformity of 
the ring. In case of variable power increment between 
adjusting outputs, line parameters (characteristic 
impedances) have to be modified accordingly to system 
requirements. This obviously slightly increases number of 
parameters to be optimized by numerical routines but, 
generally the structure is still possible to be treated 
analytically by the concept of in-phase and anti-phase 
excitations as long as vertical symmetry plane exists. Such 


an analysis, however, usually ignores the effects of 
discontinuities so, from practical point of view, it is safer to 
rely on conventional CAD tools [7]. 


The following section will briefly present the theoretical 
and experimental results of structures shown in Fig. 5 and 
fabricated on RT Duroid™ substrate laminate of the 
thickness 4=1.27 mm and electric permittivity ¢=10.2. The 
line width of the central ring equals 1.18mm and matches 
that of the feeding line. The smallest microstrip section 
length Z=27mm. Due to the symmetry of the structure the 
expected output level is -6dB. Exemplary representative 
results of numerical simulations and measurements are 
shown in Figs. 6-10. Figs. 6-9 refer to asymmetrical 6-port 
hybrid-ring structure while Fig.10 refers to the 7-port 
symmetrically loaded ring of Fig. 5-b, respectively. 


.95 0.97 0.99 1.01 1.03 1.05 1.07 1.09 wn 1.13 1.15 

Fig. 6 Theoretical (t) and experimental (x) reflection and transmission 
characteristics for the 2" and 3 ports of 6-dB six-port hybrid-ring 
coupler with in-phase excitation. 


Fig. 7 Theoretical (t) and experimental (x) reflection and transmission 
characteristics for the 2? and 3™ ports of 6-dB six-port hybrid-ring 
coupler with anti-phase excitation. 
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Fig. 8 Experimental differential in/anti-phase shift for 2™ and 3™ ports of 
6-dB six-port hybrid-ring coupler. 
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Fig. 9 Theoretical (t) and experimental (x) reflection characteristics of 6- 
dB six-port hybrid nng coupler. 


6( 46) 


Sei 


Fis. 10 Simulsted reflection and transmissim characteristics of 6B 
seven-port hybrid nng combiner divider. 


Very good power balance between output ports around mid- 
band frequency has been achieved in cases of both in-phase 
excitation. Fig 6, and anti-phase excitation. Fig 7. of the 
extended version of the hybrid ring coupler. It is also worth 
mentioning that the reflection characteristics at different 
ports, especially at the outputs. of the structure are also 
below the levels accepted from practical point of view in 
professional solutions. 


The behavior of the diferential phase shift between outputs 
situated at different side branches of the hybrid ring shown 
in Fig. 8 follows theoretical predictions. However. in case 
of anti-phase excitation the frequency dependence becomes 
visible. Fig. 9 demonstrates frequency response of the 
reflection coefficients simulated by means of CAD tools 
and measured, with the inclusion of removable SMA-to 
microstrip transition, in terms of the network analyzer. 


Though the simulations and experiments have been 
performed in relatively low frequency region, it should be 
pointed out that qualitative behavior of the structure will 
conserve properties reported in this paper and, dependently 
on the combination of the frequency and substrate 
parameters, more significant dispersion may occurs in some 
cases. 


Fig. 10 _ presents finally theoretical selected reflection and 
transmission coefficients of symmetrically loaded extended 
hybrid ring structure. No experimental data are reported for 
this structure. The obtained numerical results demonstrate 
well balance power split at different outputs and reasonably 
correct reflection response. The structure of Fig. 5-b can 


only be used for in-phase excitation of the output associated 
with opposite side branches, not necessary for equal 
magnitude power division 


V_ CONCLUSION 


The results of the modeling of electrical behavior of 
conventional and modified 3-dB hybrid nng based power 
combiner/divider fabricated in integrated microsimip Ime 
based technology have been presented and concluded 
Analytical results have been verified experimentally for 
circuits manufactured on RT-Duroid® substrate material of 
high electric permittivity (¢=10.2). Modified version of 
conventional hybrid-ring power combiner/divider has been 
proved superior to original structure by means of broadband 
operation. Additionally, multiport structures with 
symmetrical and asymmetrical loadings have been proposed 
as a natural extension of original 3dB components. These 
structures enable practical realization of multiport power 
combiners/dividers without the need of multiple base 
elements. Very good agreement between theoretical 
predictions and experiments has been found These 
structures, in author's belief, can successfully compete in 
many design problems with conventional solutions based 
on tree-like or corporate arrangements. Moreover, different 
power levels at various outputs can be obtained by proper 
selection of the transmission line section impedance. 
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Simulation of ATM Network Behavior with 
the Event Driven ATM Simulator 


Mladen Kos, Member, IEEE and Zdenko Vrdoljak 


Abstract — We describe ATM Simulator (AS), a simulation- 
based traffic analysis tool for ATM networks and related 
protocols. Concentrating on basic problems of acceleration of 
an event driven simulator, we explain our solutions for 
realization of specific simulator features and present several 
useful implementation tips and tricks. We bring solutions for 
efficient modeling of CBR (Constant Bit Rate) and VBR 
(Variable Bit Rate) traffic generators and network 
components. In the end we present few simulation results in 
order to the simulator performance and future development 
directions. 


I. INTRODUCTION 


Rapid growth of broadband service demands on the market 
initiates rapid development of networks offering 
transmission at high data rates, primarily of ATM networks. 
High prices of new broadband services and network 
equipment imply careful design of new broadband 
networks. Network design tools play the main role in 
network design process, due to the price and performance 
optimization. However, heuristic optimization methods 
often take advantage of statistical traffic models that often 
neglect some important traffic parameters. This fact brings 
us in position to verify traffic performance of optimized 
network configurations without expense of building real 
networks. In case of eventual traffic bottlenecks, we 
introduce corrections into initially optimized network 
configuration. One could say that simulation is more than 
solely verification tool. It is an important constituent of 
network design process. 


Simulation-based network design and analysis has become 
the norm for today's telecommunications technology. 
Network planners need suitable and reliable network design 
tools. Unfortunately, it is very hard to find such suitable 
tools because of the wide variety of problems network 
planners are trying to solve. No simulation-based design 
tool supports all design issues. That's why network planners 
have to dispose of certain simulation technique knowledge 
and a reliable and upgradeable simulator basis in order to 
simulate specific traffic generators, network components 


and protocols, initially not implemented by the simulator 
basis. 


The authors are with the Department of Telecommunications, Faculty of 
Electrical Engineering and Computing, University of Zagreb, Croatia 
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Purpose of ATM Simulator (further in text AS) we present 
in this paper is to give network planner simulation basis that 
can be upgraded by new components as easy as possible. 
Simulation technique chosen for this simulator is known as 
event driven simulation technique and it has been chosen 
among others due to its simplicity and reliability. Object- 
oriented programming alleviates upgrading and widely 
spread operating system helps publicity of this simulator. 
Programming language C++ has been chosen because of its 
popularity among telecommunication engineers. 


The next section explains event-driven simulation technique 
and basic program components needed for its operation. 
Section 3 deals with CBR traffic and VBR traffic modeling 
and simulation. In section 4 we explain simulation model of 
basic network components: BTEs (Broadband Terminal 
Equipment), links and switches. In section 5 we present and 
comment some simulation results in order to demonstrate 
and verify simulator performance. 


II. EVENT DRIVEN SIMULATION TECHNIQUE 


Event driven simulation of a system is based on the idea to 
assign time of occurrence to every relevant event in the 
system, in this case in ATM network. Examples of relevant 
events are arrival of a cell in switch buffer, arrival of a cell 
in destination port or cell discarding from a switch buffer, 
etc. Relations between events and time are products of 
event driven simulation and we use them to measure traffic 
parameters. Events are represented with programming 
objects containing timestamp of occurrence, what 
component sends and receives them, event type and other 
necessary variables. Network components are represented 
with programming objects too. Together with application 
components, they are basic building blocks of the simulator. 
ATM and other protocols are implemented inside of those 
objects. Components send each other events in order to 
communicate and send cells through the network. 


Consider two network components: switch and link. Link 
owns a cell it wants to proceed to the switch with delay At. 
It issues and sends event to the switch with timestamp At + 
global simulation time. When switch receives an event, it 
takes cell from the link, stores cell to buffer and sets global 
time to value of timestamp piggybacked in the received 
event. This is example of a regular event. In case that a 
component sends event to itself, we deal with private event. 
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Event driven simulator consists of an event queue, which 
keeps events sorted by time, arbitrary number of 
components exchanging events and event manager (Fig. 1). 


ae 
1 2 n 


Fig. 1. Event Driven Simulator Machine 


Event manager runs simulation in the way that it takes first 
("oldest") event from the queue and proceeds it to the 
corresponding component object. When component object 
receives event, it performs certain actions and in the end 
sends one or more events to event queue. Global simulation 
time is always set to the value of the received event's 
timestamp. After component is finished, event manager 
takes program control again and the process continues. 


Before simulation starts, event manager creates and 
configures components according to network configuration 
file and initiates them. A number of components send their 
first private and regular events to event queue during 
initiation phase, so that-event manager has events to start 
with. After initiation, event manager sends its own event for 
simulation termination and starts simulation by setting 
global time to zero value and taking and proceeding first 
event. When it reaches termination event, it stops 
simulation and frees memory occupied by components, 
events and component products (objects representing cells 
or other packets that were present in the network at the 
moment of termination). 


The basic problem with any simulator is improving its 
speed. Simulator designer has to pay attention to 
components' complexity and complexity of event queue. In 
case of AS, we have decided to implement event queue as a 
heap with somewhat improved heapsort algorithm. 
Improvement is achieved in algorithm for inserting new 
event into the heap. 


Fig. 2. Heapsort algorithm acceleration trick 
The trick is in finding the route to follow in the heap tree 
before inserting a new member. 


The route is easily found by converting number of elements 
in heap + 1 into binary representation and by cutting the 


most significant bit 1 together with all "higher" bits. For 
instance, if number of elements in heap is 12, 12+1 = 13 = 
...0 110 1b. After cutting remains 1 0 1 b. If we consider 
1 as "right" and 0 as "left", we have the route (Fig.2). This 
significantly simplifies and accelerates heapsort making it 
as fast as quicksort algorithm. 


Acceleration of specific components is subject of next two 
sections. 


Ill. ATM APPLICATIONS 


Application and network component objects are the basic 
building blocks of the simulator. This concept has been 
taken from NIST ATM/HFC simulator described in [1]. 
Applications play the important role of traffic generators. 
There is a large number of traffic generator models. We 
only explain the way they can be implemented in event 
driven simulator. 


CBR application is the easiest to implement. Cell rate is 
used to calculate period of cell repetition AT. When 
initiated, application sends its first private event to event 
queue with timestamp set to value of time when application 
starts generating cells. When application receives its private 
event, it sets global time to value of event's timestamp, 
generates new cell object and sends regular event directed 
to BTE - network component that simulates broadband 
terminal equipment. Regular event contains pointer to cell 
object just created. Its timestamp is set to somewhat higher 
value than global time, thus simulating delay. In the end, 
application sends private event with timestamp set to value 
of AT + global time and gives program control to event 
manager. This process continues until application decides to 
stop sending cells. 


As mentioned before, event is an object. When a component 
receives private event, it is advised not to delete it if 
application, or any other component, prepares to send new 
private event. Creating and deleting objects takes a lot of 
processing time. It is better to send the same event object 
with changed timestamp or other variables if needed. It is 
also advisable to create a dead object repository. Notion 
dead object designates object that should have been deleted 
but kept for future use. Reaching objects from repository, if 
there are any, additionally accelerates simulation. 


Described CBR generator has the most simple model. 
Simulation necessities will ask for more complex models, 
for instance model where cell rate changes in time. In this 
case, application has to send additional private event in 
initiation phase. When application receives this event, it has 
to determine new cell rate, recalculate cell repeat period AT 
and send the same private event shifted in time for a 
specified period of time. Additional private events could be 
used to simulate call establishment and termination. 


AS implements CBR application that emulates traffic 
generated by generally n CBR traffic generators. First idea 
was to realize this application by multiplexing traffic from 
equal CBR applications. However, if m is a large number, 
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memory occupied for m application objects could bring 
simulator to point at which operating system does a lot of 
Swapping on computer's hard disk. This significantly 
decreases simulator speed. Generally, simulator designer 
has to use as less memory for simulation as possible. This is 
why AS uses a single application only for generating traffic 
of all m sources. Trick consists of maintaining variables 
(cell rate, mean call duration. ...) for all n sources inside of 
a Single application object. Events have to piggyback id that 
corresponds to traffic source they belong to. id will be used 
as the reference to what variables set to use in order to 
process event arrival. 


Simulation of VBR traffic is similar to CBR traffic 
simulation. PCR (peak cell rate) and SCR (sustainable cell 
Tate) are used to calculate different parameters needed for 
traffic synthesizing. The simplest VBR traffic generator is 
Poisson ON/OFF traffic generator. Its application maintains 
variable state. If state is 0, no cells are generated and sent. 
If state is 1, cells are generated at speed of PCR. 
Mechanism is the same as of simple CBR application. The 
second private event is needed to change state variable. 
When it arrives, state simply changes from 0 to 1 or from 1 
to 0. If state is 0 and the first private event arrives. 
application simply throws it to event repository thus 
terminating cell flow process. When state changes from 0 to 
1 a new cell and corresponding regular and private events 
are generated and sent thus continuing the process. The 
following relation has to exist between parameters: 


SCR = PCR. JON. 


Togs 


where Tp, designates mean ON period duration and To;; 
mean OFF period duration. 


One of the most important VBR traffic generators is 
self-similar VBR generator. It is often used as ABR traffic 
generator too. Its importance Hes im the fact that network 
airival processes are much better modeled by statistically 
self-similar processes than with Poisson processes [2]. VBR 
video traffic is one of the examples [3]. Basic problem with 
generation of self-similar traffic is fast generator of self- 
similar process samples. AS took advantage of fast Fourier 
transform method for synthesizing approximate self-similar 
sample paths by V. Paxton [4]. Paxton’s algorithm is based 
on synthesizing process sample paths that have the same 
Power spectrum as self-similar process known as fractional 
Gaussian noise. AS generates self-similar traffic by 
introducing fixed time periods called bins. Every bin is 
assigned a fixed cell rate calculated out of linear transform 
of a corresponding sample path element. This implies that 
simulator has to synthesize sample path(s) and calculate bit 
rates for every bin before simulation starts. Cells are 
generated the same way as with simple CBR application, 


by self-similar VBR application, new cell Tate is taken and 
new cell repeat period AT is calculated. Bin duration is a 
variable set by the user. 


IV. NETWORK COMPONENTS 


AS implements three network components: BTE 
(Broadband Terminal Equipment), switch and link. BTE is 
the simplest component. It implements a priority buffer that 
collects cells coming from applications and proceeds them 
to link. In opposite direction it demultiplexes cell flow by 
sending received cells to corresponding applicati 
without previous buffering. Priority buffer is the same as 
the one used in switches. CBR and VBR cells have the 
same priority, but rt-VBR cells have higher priority than 
nt-VBR cells. ABR cells have lower and UBR cells have 
the lowest priority. 


Simulator speed largely depends on link component 
implementation. Link emulates transmission system 
between two switches or switch and BTE. Observed in 
time. cell flow very often reminds on Cantor set with 
exchanging burst and idle periods. Taking this into account, 
we conclude that if the load on link is small, simulator will 
spend a huge amount of time on transmitting idle cells. Idle 
cells are not of interest for simulation. It's better not to 
simulate them if idle period is relatively long. Link owns 
private event that repeats every cell slot repeat time. 
Receiving and resending this event is what significantly 
slows down simulation. That's why link implemented in AS 
closely cooperates with buffer it takes cells from, in order to 
determine idle periods. When there are no cells to transmit 
in buffer, buffer informs link to stop sending private events. 
Link thus does not transmit idle cells. After buffer receives 
cell from the application or switching element it informs 
link that burst period started and link restores its private 
event thus continuing sending cells. 


AS implements three switch types, with input, output and 
central buffering strategies (for instance [5]. Switch 
element has bus topology and it serves all ports at speed of - 
the fastest port. This method is not effective in case where 
there are very fast and very slow ports. Situation could be 
improved by serving more cells from faster ports in a single 
tact of the bus, but one encounters cell priority problem. 
This option if for further study. 

Significant performance improvement can be achieved in 
case of central queuing switch. In reality, central queuing 
Switch implements very complex memory management 
System. Simulation model is shown in Fig. 3. 
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Fig. 3. Simulation model of switch with central queuing 


Memory management in central queuing switch can be 
avoided by simulation of output queuing switch in which 
every buffer has capacity of entire central buffer. Cell 
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processing delay is simulated by transferring cells from 
buffers to cell registers with user defined delay at cell rate 
equal to rate of the corresponding outgoing link. In case that 
this delay is neglectable, it is recommended to disable the 
mechanism by allowing link to take cells directly from 
buffer. 


V. DEMONSTRATIONAL SIMULATIONS 


As demonstration of event driven simulator performance, 
we first present results of two simulations in order to 
compare performance of switches with outgoing and central 
queuing strategy. Both switches have been under identical 
traffic load. Traffic generators were VBR sources of video 
traffic. Fig. 4. shows comparison of number of cells 
discarded by both switches. Total SCR flow loaded on 
switches was 210 000 cells/s. Switch had three STM-1 
ports. Traffic from first two incoming ports was directed to 
the third one. There was no traffic incoming from the third 
port. Total buffer capacity was 3000 cells, ie. 1000 cells 
per port in case of switch with output queuing and 3000 
cells capacity in case of central buffer. 


No. of cells discarded 


44000 
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30000 


20000 
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Fig. 4. Number of cells discarded by output and central queuing switches. 
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It is very important to emphasize that both switches have 
equal buffer sizes. Due to the better memory utilization of 
switch with central queuing, number of cells discarded is 10 
times smaller than in case of switch with output queuing, 
which significantly improves traffic performance. 


In the end we would like to demonstrate obvious difference 
between traffic characteristics of Poisson process modeled 
traffic and traffic synthesized with self-similar process [2]. 
Both traffic traces on Figure 5 are obtained by AS VBR 
applications. Mean value of both sources was 180 000 cell/s 
and transmission system used was STM-1. Hurst parameter 
of self-similar traffic generator used was H = 0.9. Observed 
on relatively small time scales, Poisson traffic shows almost 
identical behavior as self-similar traffic. As the time scale 
grows, Poisson traffic intensity "averages" around mean 
value, whereas self-similar traffic remains bursty. In this 
case time period of 10 seconds can be assumed a relatively 
large time scale. 
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Fig. 5. Comparison of Poisson ON/OFF and self-similar traffic 
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VI. CONCLUSION 


Simulation-based network design has become the norm for 
today's telecommunication technology. Unavailability of 
upgradeable or suitable simulation-based design tools 
forces network designers to learn simulation techniques. 
Event driven is the simplest simulation technique, but it is 
suitable for fast and easy modeling of communication 
processes. Intention of this paper was to introduce event 
driven simulation technique to readers and to give several 
tips and tricks as ideas for optimization of reader's future 
simulators. AS is upgradeable simulator with purpose to 
serve as a good simulation basis for further upgrading with 
new network components and traffic generator models. AS 
concept was introduced in order to present capabilities and 
potentiality this simulator has. AS is simulator in progress 
and in the close future it will be capable of carrying much 
more serious simulations including all OSI stack layers. 


REFERENCES 


[1] Nada Golmie et al., "The NIST ATM/HFC Network Simulator. 
Operation and Programming". U.S. Department of Commerce, 
National Institute of Standards and Technology, December, 1998. 

[2] W.Leland, M. Taqqu, W. Willinger, and D. Wilson, "On the 
Self-Similar Nature of Ethernet Traffic", Proceedings of ACM 
SIGCOMM '93, pp. 183-193, 1993. 

[3] M.W. Garret and W. Willinger, "Analysis, modeling and generation 
of self-similar VBR video traffic," Proceedings of ACM 
SIGCOMM '94, pp. 269-280, 1994. 

[4] V. Paxton, "Fast Approximation of Self-Similar Network Traffic", 
Laurence Berkeley Laboratory, University of California, Berkeley, 
LBL-36750, April 1995. 

[5] M. de Prycker, "Asynchronous transfer mode: solution for broadband 
ISDN", 3“ ed., Prentice Hall Inc., Englewood Cliffs, NJ, 1995. 


10 Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 205 


On blocking probability equalisation in switching 
networks with rearrangement and reservation 
mechanisms 


Maciej Stasiak, Mariusz Glabowski, Robert Pekal, Piotr Zwierzykowski 


Abstract—In the paper methods for increasing traffic load 
capacity of switching networks servicing integrated traffic 
have been considered, i.e. bandwidth reservation mechan- 
ism and rearrangement mechanism. Methods for calculating 
the upper and lower bound of blocking probability in TST 
networks have been presented, which correspond to the min- 
imum and maximum traffic capacity of the network. Calcu- 
lation results have been compared with simulation data. 


I. INTRODUCTION 


The assignment of constant bit rates to the VBR (Vari- 
able Bit Rate) traffic sources enables the evaluation of 
traffic characteristics of switching systems in the B-ISDN 
(Broadband Integrated Services Digital Network) network 
by means of methods worked out for the multi-bit rate 
circuit switching [1,2]. In multi-rate models resources re- 
quired for the connections of particular classes are the mul- 
tiplicity of certain value of bandwidth, the so-called Ba- 
sic Bandwidth Unit (BBU). In circuit-switched networks, 
the bandwidth unit is well defined as time slot or chan- 
nel. While constructing multi-rate models for broadband 
systems, it is assumed that BBU is the greatest common di- 
visor of the equivalent bandwidths of all call streams offered 
to the system [2]. 

The quality of broadband services can be improved by 
different methods, e.g. by bandwidth reservation as well 
as by introducing rearrangement mechanisms to switching 
nodes. Bandwidth reservation in multi-service switching 
systems was the subject of many analyses, e.g. [2-10]. In [3] 
an approximate method of calculating the occupancy dis- 
tribution in the full-availability group with reservation was 
quoted and in [7-9] reservation algorithms for multi-service 
switching networks with point-to-point and point-to-group 
selection were formulated. Rearrangement mechanisms for 
one-slot switching systems were the subject of many exam- 
inations and analyses, among others in [11-14]. Practical 
application of these mechanisms is, however, profitable first 
of all in networks servicing multislot connections. 

The aim of this paper is to evaluate the influence of the 
proposed reservation and rearrangement mechanisms on 
blocking probability in switching networks. In Section 2, 
the reservation mechanism for the full-availability group 
with reservation has been presented. In Section 3, a re- 
arrangement algorithm for TST networks has been pro- 


_Institute of Electronics and Telecommunications, Poznan Univer- 
sity of Technology, ul. Piotrowo 3A, 60-965 Poznan, Poland. 


Mariusz Gtabowski is supported by the Foundation for Polish Sci- 
ence. 


206 


posed. In Section 4, a method of calculating the upper 
bound of blocking probability [15] (TST network without 
reservation) has been quoted and in Section 5, a method 
of calculating the lower bound of blocking probability has 
been proposed (switching networks with ideal rearrange- 
ment mechanism). In Section 6, the calculation results 
have been compared with the simulation data. Section 7 
concludes the paper. 


IJ. FULL-AVAILABILITY GROUP 


Full-availability group is a model of separate transmis- 
sion link. Let us consider the link with a capacity of V ba- 
sic bandwidth units. The system services m independent 
classes of Poisson traffic streams having the intensities: 21, 
A2,---;Am- The holding time for calls of particular classes 
has an exponential distribution with the parameters: 11, 
[12,--+5/lm. Thus the mean traffic offered to the system by 
a Class 7 traffic stream is equal to: 


(1) 


A class 7 call requires t; basic bandwidth units to set up 
a connection. 

The service process in the full-availability group can be 
determined by multi-dimensional Markov process. This 
process can be approximated by the one-dimensional 
Markov chain, which is described by the so-called gener- 
alised Kaufman-Roberts recursion: [2,3, 10, 16, 17]: 


a; = Ai / 1G 


nP(n) = Se ajtioi(n — ti) P(n — ti) 


s=1 


(2) 


where P(n) is the state probability (ie. the probability 
that there is n busy BBU in the system) and o;(n) is 
the conditional probability of passing between the adja- 
cent states of the process (i.e. the probability of admission 
of class i call to the service when the system is found in 
the state n). 

In the case of the full-availability group the o;(n) is equal 
to one for all states in which the system can be found: 


(3) 


The blocking probability for the class i stream in the full- 
availability group can be expressed by the formula: 


oi(n) =1 for eachn 


"4 
bi)= S$) Pn) 


n=V—-t;4+1 


(4) 
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In the systems with reservation the reservation threshold 
Q for each traffic class is designated. The parameter Q de- 
termines the borderline state of a system, in which servicing 
calls of a given class is still possible. All states higher than 
Q belong to the reservation space R = V — Q, in which 
class i calls are blocked. According to the equalisation rule 
[2,3], the blocking probability in the full-availability group 
will be the same for all calls if the threshold Q for all traffic 
classes is identical and equal to the difference between the 
group capacity and the number of BBU required by a call 
of maximum demands (tm = tmax): 


Q=V —tm (5) 
The occupancy distribution in the full-availability group 
with reservation [3] is determined by recursion (2), in which 


probabilities of passing are equal to: 


0 for n>Q 


for 1< 2 (6) 


oi(n) = { 
According to (2), (6), blocking probabilities for calls of 
all classes can be calculated as follows: 


V 
i)= > Pin) (7) 


n=Q+1 


III. REARRANGEMENT MECHANISM IN TST NETWORKS 
A. Blocking in TST networks 


Let us consider a 3-stage TST (Time-Space-Time) 
switching networks (Fig. 1). In the stages 1 and 3 of 
switching networks there are time switches that execute 
time switching process. In the second stage there is N link 
space switch that executes space switching process. Us- 
ing double-buffered time switches with implemented TSSI 
mechanism (Time Slot Sequence Integrity) ensures that 
time stages in switching newtworks are non-blocking. So, 
blocking in TST network can be considered as blocking in 
the space switch. Let us assume that one call requires t 
BBU to set up a connection. An internal blocking event 
occurs when both incoming and outgoing links have the 
sufficient number of idle slots to set up the connection but 
the positions of these slots are mismatched at least for one 
of the requested slots. An external blocking event occurs 
when a given incoming or outgoing link of the space switch 
has not the sufficient number of idle time slots. Because 
internal and external blocking events are independent, the 
total blocking probability is the sum of internal and ex- 
ternal blocking probabilities. 


B. Rearrangement algorithm 


The application of rearrangement mechanisms in switch- 
ing networks can lead to decrease the value of blocking 
probabilities of particular or all traffic classes, and simul- 
taneously, to increase the value of carried traffic. 

The rearrangement mechanism in TST switching net- 
works consists in shifting of servicing call in such a spe- 
cific way that makes it possible to set up a new connection 
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Fig. 2. Blocking event in TST network 


between given incoming and outgoing links of space switch. 
Let us consider a class i connection (t; = 2) between the 
incoming link 1 of space switch and its outgoing link 2 
(Fig. 2). It was assumed that each link has a capacity of 
V time slots. The considered call between links 1 and 2 is 
blocked because there is only one free and available time 
slot which appears at the same time position in both links 
(slot C). 2-slot connection between the given links 1 and 
2 may be set up if certain existing connection, e.g. from 
time slot A in incoming link 1, can be rearranged, i.e. it 
can be carried by another idle time slot of the given link. 
For example, a call in incoming link 1 in time slot A can be 
rearranged to time slot D, because slot D in both incoming 
and outgoing links (inlet 1 and outlet 3) is idle. 

The structure of rearrangement algorithms should pre- 
vent the rearrangement of servicing calls to slots which 
can be directly used to set up the blocked call (e.g. slot 
C). There can be several structures of rearrangement al- 
gorithms. If connections are shifted only in one link, this 
is known as one-sided rearrangements. If connections are 
rearranging in both incoming and outgoing links, this is 
known as two-sided rearrangements. Moreover, algorithms 
can be split into single-attempt and multi-attempt ones. 
Single-attempt rearrangement algorithm makes only one 
attempt over one time slot to rearrange a connection, as op- 
posed to multi-attempt algorithm, where several attempts 
can be made over all time slots. In multi-attempt al- 
gorithms the number of permitted rearrangements depends 
on many parameters, e.g. on the maximum number of 
lacked idle time slots required to set up a new connection. 
The different rearrangement algorithms for TST switching 
networks have been investigated by the authors. It has 
been proved that the most effective is two-sided rearrange- 
ment algorithm. 

Let us consider the two-sided rearrangement algorithm. 
Let us assume that a given call requires t time slots to set 
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up a connection (1 < t < V), where V is the capacity of 
a link. Let us assume that only w idle and available slots 
(w < t) could be found in the incoming and outgoing links 
appointed to set up the given connection. If the difference 
(t — w) is larger than the number of permitted rearrange- 
ments a, then the call is blocked. If there is a possibility 
to set up a connection, the control system selects certain 
number of time slots x (x < @) in incoming and outgo- 
ing links. Then the control system rearranges (shifts) the 
connection in both incoming and outgoing links to make 
appropriate number of time slots (t — w) available to set 
up the given connection. If (t — w) slots could be found 
in x attempts, the call will be accepted by the switching 
network. In other case the call will be blocked. 


IV. UPPER BOUND BLOCKING PROBABILITY [15] 


The calculations of the upper bound of blocking prob- 
ability in TST networks are carried out on the assump- 
tion that no mechanisms decreasing internal blocking have 
been introduced into the networks. Such calculations can 
be evaluate on the basis of the method [15], derived for 
TST switching networks with mixture of different multi- 
rate traffic streams. 


V. LOWER BOUND BLOCKING PROBABILITY 
A. Switching network without reservation 


Let us assume that a mechanism which completely elim- 
inates internal blocking probability has been applied in the 
considered switching network. 

Let us assume that z and y denote the number of busy 
slots in incoming and outgoing interstage links, respect- 
ively. P(x) and P(y) represent respectively the probabilit- 
ies of z and y time slots being busy. Let us assume further 
that the probability of calls appearance on each inlet and 
outlet of TST switching network is the same and it is in- 
dependent from other links. 

Setting up ¢;-slot call means that the switching network 
has to pass from the state (x,y) to the state (x +w,y+w) 
if the the following conditions are fulfilled: 

r<V—-t, y<V—-t; (8) 

The occupancy distributions P(x) and P(y) can be 
approximated by the occupancy distribution in the full- 
availability group (4). Thus the blocking probabilities of 
class i call in incoming link — Pb; and in outgoing link — 
Pb, can be expressed as follows: 


V 


>> Ply). 


z=V—ti41 


"4 
Poi;= S° P(x), Pho, = 


z=V—ti41 


(9) 


Total blocking probability for class i call can be determ- 
ined by the following formula: 


B; = 1-— (1 — Pbi;)(1 — Pbo;) (10) 


Let us notice that the probabilities Pbi; and Pbo; (9) 
are mutually dependent because the blocking probability of 


one incoming link influences the amount of traffic offered 
to outgoing link. It means that the given outgoing link 
is offered only the part of traffic which is not blocked in 
incoming link. Thus, using the fixed point methodology 
[18], the formulae (9) can be rewritten as: 


Pbi; = f [ai(1 — Pbo;)] ’ Pbo; = if [a;(1 = Pbi;)| (11) 


where: f[a] - blocking probability in the full-availability 
group determined by the Kaufman—Roberts recursion 
((2),(3),(9)). The occupancy distributions P(x) and P(y) 
are identical and consequently Pbi; = Pbo; = Pb;. Thus 
the blocking probability for class i call can be calculated 
iteratively according to the following formula: 

Pot) = f [aa - Pr\”)| (12) 
where: Pps? - blocking probability for class i call after the 
kt» step of iteration. The iteration process is repeated until 
the given accuracy € is obtained: 


k k+1 k 
|Pos” — pelt? Pol") <¢ (13) 


B. Switching network with reservation 


Let us assume additionally that the reservation mechan- 
ism (described in Section II) has been applied to both in- 
coming and outgoing links of TST switching network with 
rearrangement mechanism. This means that any new con- 
nection can be set up only if in given interstage links (in- 
coming and outgoing) there are tmax idle time slots: 


&<V —tnsxs  y SV —tex- (14) 


In switching networks with reservation the occupancy dis- 
tributions P(x) and P(y) are approximated by the occu- 
pancy distribution in the full-availability group with reser- 
vation ((2), (6), (7)). Further calculations are carried out 
by the iterative method described in the previous section. 


VI. EVALUATION OF RESEARCH RESULTS 


In order to confirm the assumptions adopted in the pro- 
posed methods, the results of analytical calculation of the 
upper bound and lower bound of blocking probability have 
been compared with the simulation results of TST switch- 
ing networks. 

In Figs 3-4 the calculation results of the switching net- 
work are compared with the simulation results. All the 
results are expressed in relation to the value of total traffic 
offered to a single BBU. The results presented have been 
obtained in the switching networks structured as shown in 
Fig. 1, where f = 256 BBU and N = 8. Two traffic classes 
(t: = 1, to = 2) were offered to the networks in the follow- 
ing proportions: a; : a2 = 6: 1. The following notation 
has been used: LBBP — lower bound of blocking probab- 
ility (calculation results), UBBP — upper bound of block- 
ing probability (calculation results), SNLB — lower bound 
of blocking probability (simulation results), SNRS(z, y) - 
simulation results of switching networks with x and y per- 
mee rearrangements for calls of class 1 and 2, respect- 
ively. 
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Fig. 3 shows the results of blocking probability in TST 
network without rearrangement and reservation mechan- 
ism. The calculations have been performed in accordance 
to the method [15]. Approximation error is inversely pro- 
portional to the number of BBUs required by calls of par- 
ticular classes — the bigger number of BBUs, the greater 
accuracy. In most cases the approximation error does not 
exceed 10% and the results obtained can be considered as 
satisfying in engineering applications. Calculation results 
for the lower bound of blocking probability were also com- 
pared with simulation data for the lower bound of block- 
ing probability, as well as with simulation data for TST 
networks with rearrangement mechanisms introduced for 
both cases without reservation (Fig. 3) and with reserva- 
tion (Fig. 4). In the experiments carried out the two-sided 
rearrangement algorithm was applied (Section III-B). Se- 
lected results shown in Fig. 3 and 4 and simulations carried 
out by the authors demonstrate that the proposed method 
for calculating the lower bound of blocking probability has 
a good deal of accuracy for different parameters of switch- 
ing networks, streams of traffic, and proportion between 
traffic intensities in particular classes. Calculations made 
according to the proposed method are not complicated and 
are easily programmable. 


VII. CONCLUSIONS 


In the paper approximate methods for calculating some 
relevant characteristics of TST networks have been pro- 
posed, i.e. the so-called lower and upper bound of loss 
probability. Especially relevant. is the lower bound, which 
indicates the least possible value of blocking probability 
that can be reached using ideal mechanisms increasing the 
traffic load capacity of switching systems. The influence 
of the rearrangement and reservation mechanisms on the 
traffic load capacity of TST networks servicing integrated 
traffic has been also investigated. It was stated that ap- 
propriate number of rearrangements for calls of different 
classes leads to eliminate the phenomenon of internal block- 
ing, whereas the reservation mechanism enables to equlise 
external blocking probabilities for all traffic classes. 
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Fig. 3. Lower and upper bound of blocking probability in TST 
network with rearrangement, but without reservation mechanism. 
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FPGA implementation of an ATM traffic 
classifier for Quality of Service Management 


S. Holgado, S. Lopez-Buedo, Member IEEE, A.J. Pearmain, Member IEEE 


Abstract— One of the main problems in modern broadband 
networks is the variation of Quality of Service (QoS) that can 
occur. A very common technology used for broadband 
networks is ATM (Asynchronous Transfer Mode). ATM can 
offer QoS guarantees for a number of different traffic classes. 
In an ATM switch it is possibile to discard low-priority cells 
when the buffer is full. This can be done using logic 
implemented on a FPGA (Field Programmable Gate Array) 
chip, used because of its flexibility and easy re-configuration. 
This paper presents the FPGA implementation of an ATM cell 
priority classifier, which can be used in QoS management. The 
circuit directly receives a 155 Mbps ATM serial stream from a 
physical medium device, and performs cell delineation, serial to 
parallel conversion and classification according to the cell loss 
priority (CLP) bit. It has been implemented on a Xilinx Virtex 
XCV50PQ240-6 FPGA, occupying 8252 equivalent gates. 
Simulation results show that the circuit can directly process a 
155 Mbps ATM stream. 


Index Terms-- ATM, FPGA, quality of service, high-speed 
digital design, traffic management, high-speed networking, 
CLP, cell delineation. 


I. INTRODUCTION 


One of the problems of pure ATM is the difficulty of 
guaranteeing the quality of service (QoS) [1]. The 
importance of QoS depends on the nature of the service 
being carried. Data transfer normally uses a protocol that is 
robust against cell loss or cell delay. Errors can be detected 
and cells re-transmitted without significant effect on the 
transfer. However, real-time services such as audio and 
video are severely impaired by cell loss or non-deterministic 
delay. An ATM switch always has a finite buffer size and 
this leads to the possibility of cells being discarded when 
the buffer is full. The statistical variation of 
telecommunication traffic bit rates can be very wide, 
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particularly for video, making design of buffers for worst- 
case bit rates uneconomic. 


One feature has been included in the ATM cell header 
definition to help in managing quality of service. This is the 
priority bit. Users of services that are severely impaired by 
cell loss or delay will be willing to pay higher charges for a 
higher quality of service. Improved quality can be achieved 
by setting the priority bit. When this bit is set the cell can be 
given special treatment so that other cells are discarded or 
delayed instead of the prioritised cell. 


In an ATM cell, the priority bit is placed in the CLP (Cell 
Loss Priority) field inside the cell header (Fig. 1), and it is 
therefore necessary to look at the bit number 32, counted 
from the beginning of the cell, to know the priority of the 
cell. The CLP field allows cells with higher priority to be 
separated from those with lower priority. If the CLP field is 
set to 1, the affected cell is assigned a low priority; if set to 
zero the cell is given a high priority. 


The objective is to use the priority bit of an ATM cell as a 
way of discarding cells in the event that the transmission 
capacity of a connection is exceeded or when congestion 
occurs in a network switch. A FPGA (Field Programmable 
Gate array) chip has been chosen to implement the logic 
required. 


Over the past 15 years, FPGAs have enjoyed phenomenal 
growth in both size and speed. The Xilinx Virtex Series 
FPGA family [2] has densities ranging from 50,000 to 3.2 
million system gates. The key advantages of FPGAs over 
ASICs (Application Specific Integrated Circuit) and DSPs 
(Digital Signal Processing) are their unique combination of 
performance and flexibility. In a FPGA it is possible, by just 
changing the configuration file, to change the circuit 
implemented on it. The device chosen to implement the 
circuit is a Xilinx Virtex XCVS0PQ240-6 FPGA. 


The high-speed features of the new generations of FPGAs 
allow direct connection of the chip to the physical medium 
devices. In this paper, a simple yet useful application is 
presented that takes an ATM stream and performs cell 
delineation and classifying, outputting the cells in byte-wide 
format. It accepts a 155 Mbps ATM stream, such as the one 
provided by physical devices like the CY7B951 [3]. 
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II. CIRCUIT DESCRIPTION 


A. Basic ideas 


The FPGA directly interfaces to the physical medium 
device. This circuit typically performs electrical adaptation 
and clock extraction, outputting two signals: data and clock. 
These signals could be directly connected to the FPGA, but 
in most cases use PECL levels that are supported by few 
FPGA families. This problem can be easily overcome using 
a level translator such as [4], which has a 160 MHz 
bandwidth, enough for this application. 


Once the data and clock signals reach the FPGA, the main 
problem that can arise is clock skew. Working with a 6.5 ns 
clock period, a clock buffer delay of around | ns (typical of 
the chosen family of devices) may appear as a serious threat 
to the overall timing of the circuit. Fortunately, this FPGA 
family has on-chip clock DLLs (Delay Locked-Loops), 
which allow the clock distribution delay to be virtually 
eliminated. Then, the only signal affected by the internal 
FPGA delays will be the data, thus avoiding any possible 
setup violation problem, because the changes in this signal 
will always follow the clock edges. 


Before looking for the CLP field, it is necessary to perform 
the cell delineation to determine the beginning of the cell. 
Cell delineation is the process that allows identification of 
the cell boundaries [5]. The method followed for 
performing this step is explained in the next section. 


B. Cell delineation 


The way in which the beginning of the cell is detected is 
calculation of the HEC (Header Error Control) field and 
comparison with the one in the cell that the circuit is 
receiving. 


The HEC field, inside the header of each ATM cell (Fig. 1), 
is calculated using a CRC (Cyclic Redundancy Checking) 
code. We know that FPGAs can implement encoder and 
decoder architectures for cyclic codes. The conventional 
implementation of both CRC generation and detection 
circuits is a simple linear binary machine, i.e., a shift 
register connected with feedback through exclusive-or gates 
[6]. The same algorithm has been used previously with 
Reed-Solomon Encoders [7]. 


The method is based on the correlation between the first 
four bytes of the cell header and the HEC field (Fig. 1), 
which is the last byte of the header. The HEC value is 
defined as the remainder of the product x* multiplied by the 
content of the header excluding the HEC field (i.e. the first 
four octets), divided by (modulo 2) the generator 
polynomial x*+x?+x+1, to which the fixed pattern 
'01010101' will be added (modulo 2). This algorithm has 
been implemented by hardware using a shift register circuit, 
as explained above (described in [6]). 


Path Identifier (VPI) 
Path Identifier (VPI) Channel Identifier (VCI) 2 
Channel Identifier (VCI) 
Channel Identifier (VCI) | Payload Type (PT) 


Header Checksum (HEC) 


CLP = Cell Loss Priority 


Fig. 1. Structure of cell header for ATM cells 


An 8-stage linear feedback shift register performs encoding. 
Figure 2 shows the encoder architecture used in our 
particular case and as the generator polynomial is 
x*+x’+x+1, just elements 0, 1, 2 and 8 are different from 
zero. 


Fig. 2. Architecture for the ATM cell header CRC register circuit 
using the generator polynomial x*+x?+x+1 


The encoding operation is carried out as follows: With the 
control of signal turned on, the 32 first bits of the ATM cell 
header are shifted into the circuit and simultaneously into 
the communication channel. Shifting these 32-bits into the 
circuit from the front end is equivalent to pre-multiplying by 
the factor x®, As soon as the complete message has entered 
the circuit, the 8 bits in the register form the remainder. 
These bits are then the HEC field. After this it is necessary 
to break the feedback connection by turning off the control 
of signal. The last step is to shift the HEC digits out and 
send them into the channel. 


All these operations are represented in the box called cell 
delineation in figure 3. 


C. Complete Circuit Design 


The complete circuit (Fig. 3) accepts a serial ATM stream 
and a clock signal. The first step is to perform the cell 
delineation. The process is repeated eight times [5] to 
ensure that the boundaries of a cell are well defined. After 
that the incoming serial bit stream is paralleled into 8-bit 
words, so that the clock rate required to process the bit 
stream is decreased. At the same time, as can be seen in 
figure 3, the circuit searches for the CLP field inside the cell 
header. A 32-state counter is used for this purpose. The cell 
is stored on 8-bit data registers while a search is made for 
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Fig. 3. Circuit architecture 


the CLP bit. The output is split in two ways depending on 
the CLP value. When this field has the value “0” the output 
is sent to the ATM stream priority line, and when the value 
is “1” the output is sent to the ATM stream no priority line 
(fig.3). Two additional outputs are obtained, depending on 
the priority, when the cell is valid. The Byte Clock output is 
just a reference used to synchronise the output of the cell 
classifier block. 


II]. CIRCUIT PERFORMANCE 


The basic features of the FPGA circuit are summarised in 
Table 1. The 171.70 MHz of maximum operating frequency 
shows that the circuit is suitable for application in 155 Mbps 
ATM networks. The design also features a small size, using 
roughly 25% of the resources of the selected FPGA. This is 
mostly achieved by using the simple bit-serial schema 
described in Section II for the CRC encoder. Typically, a 
high speed application like this one would have needed a 
faster parallel implementation of the CRC decoder [8], at 
the cost of much higher requirements of FPGA resources. 
Fortunately, the high performance of the new families of 
FPGAs makes it possible to use a more space-efficient serial 
implementation. 


TABLE I 
SUMMARY OF THE CIRCUIT FEATURES 


FPGA Type Virtex XCV50PQ0240-6 
171.70 MHz 


NumberofCLB slices 91 
Number of Flip Flops 
[Total equivalentgates [8252 


Delay 5.84 ns (4.78 ns delay 
| + 1.04 ns setup) 


212 


| 
| 
eee Cell 
Classifier 
CEP 
fetion Cell valid, no priority 


Byte Clock 


ATM stream, prority 


ig Cell valid, priority 


ATM stream, no priority 


IV. CONCLUSIONS 


A standard FPGA has been used to successfully implement 
a circuit that classifies ATM cells according to the Cell Loss 
Priority Bit in the cell header. The cell is directed to a 
separate output depending on the setting of this bit. The 
circuit has been shown to operate satisfactory above the 155 
Mbps required for ATM systems. 


V. REFERENCES 


{] Pitts, JM. and Schormans, J.A, An Introduction to ATM Design 
and Performance, Wiley, 1996. 

(2] Virtex 2.5V Field Programmable Gate Array Data Sheet v 1.7, 
Xilinx, Inc., San Jose, California. 

(3] CY7B951 Data Sheet, Cypress Semiconductor, San Jose, 
California. 

[4] SY100ELT23 Data Sheet, Micrel Semiconductor, San Jose, 
California. 

[5] ITU-T Recommendation 1.432.1, 1996. 

(6] Daniel J. Costello, Error Control Coding: Fundamentals and 
Applications, 1** Ed. Shu Lin, Texas A & M University. 

(7] E.R. Berlekamp, “Bit-Serial Reed-Solomon Encoders” 
Transaction on Information Theory, Vol. IT-28, n° 6, November 
1982. 

(8] Tomoko K. Matsushima, Toshiyasu Matsushima and Shigeichi 
Hirasawa, “Parallel encoder and decoder architecture for cyclic 
codes”, JEICE Trans. Fundamentals, Vol. E79-A, n° 9, September 
1996. 


10” Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


A modular VLSI architecture for real-time 
processing of video streaming applications 
over ATM networks! 


Gr. Doumenis’, G. Konstantoulakis”, G. Korinthios’, G. Lykakis”, D. Reisis’ and G. Synnefakis” 


Abstract-This paper presents a VLSI architecture 
specifically designed to support emerging video/data 
communication/multiplexing applications as a 
video/communication controller in an ATM network. Using 
a shared memory with efficient interconnection the design 
can accommodate either a processing system as a 
peripheral which can store data and perform specific 
operations on these, or a switching system as a buffer with 
real time processing and multiplexing capabilities. The 
architecture consists of three (3) processing modules, a 
shared memory with four (4) banks and two (2) 
input/output modules. 

The design can operate at a data transfer speed of 
622Mbit/sec. Per flow queuing, implemented in shared 
memory, supports shaping and multiplexing operations on 
data, on a process/connection basis. The design is capable of 
multiplexing thousands of flows performing shaping 
according to a traffic profile for each flow. Traffic profiles 
could be either static according to negotiable network 
parameters or dynamic in applications, such as video 
transport, where a connection presents diversity in 
bandwidth allocation requirements. Dynamic update of 
traffic profiles involves real-time measurements and 
calculatiuns of incoming packets. These operations are 
efficiently mapped in hardware/software components, 
inside the processing modules, implementing the respective 
algorithms. Performance trials of the design have indicated 
that the behaviour of the controller under heavy load 
improves with the use of the appropriate algorithms. 

The architecture embeds both the processing and the 
memory modules, thus producing a true system-on-a-chip 
solution. 

Index Terms-ATM, quality of service, video transport, 
traffic profile, VLSI 


Ie INTRODUCTION 


The design of today’s high performance communication 
and processing systems targets mainly at products 
supporting image applications. Developing these 
powerful processing systems and broadband networks 
emerges as an answer to the introduction of 
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video/multimedia services. This is due to the large 
amount of bandwidth needed to process and/or transport 
real-time digital video. The use of compressed video 
imposes the need for the enhancement of 
processing/transporting nodes with subsystems, which 
will be able to transfer compressed video from disk 
arrays, store and multiplex video streams, perform high 
level flow control, support the transportation of 
thousands of concurrent connections/streams and ensure 
the quality of the offered service. The design and 
development of devices used as video and/or 
communication controllers comes as a solution to the 
aforementioned challenges. These architectures provide 
real time processing [11], [16] and data storing 
capabilities to accommodate these requirements. 

The focus of this paper is on a parallel VLSI architecture 
which can support video control functionality. The entire 
system is realized at .25um CMOS technology, with the 
I/O performing at 622Mbits/sec and the internal 
communication rate at 2.5Gbit/sec [3]. 


II. ARCHITECTURE REQUIREMENTS 


The proposed architecture functioning as a video stream 
controller includes a straightforward mapping of video 
stream properties [5], [8], [15], [17], [18], [21]to 
transport layer procedures. 

In the proposed schema it is possible either to use 
prediction algorithms [2] or determining video activity 
in real time [2], [7], [19], [20], [22]. When the Video 
Controller operates after MPEG encoders, the 
inter/intraframe distances are accordingly adjusted in 
order to succeed the appropriate quality. The Controller 
should preserve or correct (when it operates as an interim 
node) the inter/intraframe distances while introducing 
the MPEG frames to the ATM Adaptation Layer 5 
(AALS). 

Computations are performed in the processing modules 
of the network device, which in turn, can be implemented 
as application specific hardware components when 
simple real time processing is required (i.e. address 
recognition). Otherwise, computations are realized as 
S/W tasks running on a CPU when complex, but not 
strict real time processing is required (i.e. the 
establishment of a new connection and the subsequent 
update of: Address and Shaping Look Up Tables, 
Queues). 

The memory module has to support the data storing and 
retrieval operations of raw data, the address 
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manipulation, the management of a large number of 
queues [10], [12], [13], [23] and the handling of shaping 
and/or transcoding [6]. As a consequence of the variety of 
the above operations and also of the requirement to 
perform in parallel a subset of these accesses emerges the 
need of involving a memory manager, which will be able 
to accommodate distinct requests from all the processing 
and I/O modules in minimal time. 

High bandwidth interconnection, among the modules of 
the streaming device, is required to support the high rate 
of the physical links and the data movement operations of 
the employed mux/demux techniques [9]. 

The modules have to exchange data and control as well. 
It is required then to allow data and control flows to 
utilize the same interconnection paths in order to 
facilitate synchronization and keep the cost of the device 
low. Each module then, will have enhanced functionality 
of the interface to the interconnection network to support 
the identification of data and control flows. In most 
cases, multicasting should be utilized for the 
simultaneous delivery of data to the memory and the 
processing modules. Processing modules forward control 
information (commands) to other modules through low 
bandwidth control flows. The interconnection network 
should implement bi-directional paths among modules 
allowing interaction among these. Hence, a generic 
requirement for the interconnection network is to allow 
all possible connections among modules to let each 
application to define the valid data and control flows and 
their respective reserved bandwidth according to its 
specific requirements. 


Ill. ARCHITECTURE 


The system comprises of the following modules: 

1. IO (Input/Output Module 1) and IO2 are bi- 
directional input/output interfaces. They convert the 
incoming data flow into internal data structures and 
the internal data struciures into outgoing flow. The 
controller architecture poses no restrictions on the 
number of external component interfaces nor to their 
nature. ; 

3 CPUs (Processing Modules)’. The first of these 
processors (Connection processor) will perform 
packet classification and connection handling. The 
second (Management processor) will be involved 
with the component management and has also an mP 
interface to communicate the component with the 
entire system control. The third processing unit 
(Shaper processor) handles the flow shaping, QoS of 
the flows and multiplexing. 

M’ (Memory Management Module) includes the 
memory and its controller. It also provides the 
interface for memory expansion. 

The interconnection of the modules is accomplished via a 
simple interconnection network (IN), able to support 


ho 


a 


* Within the IN-RAM project a CPU module is realized 
as a ST-20 RISC processor 


CREW (Concurrent Read Exclusive Wnite), which is 
designed as a shared bus and realized as a MUX- 
DEMUX combination. Each module has the ability to 
connect to any other module(s), via the IN. The IN 
sharing by the modules is resolved by a Scheduler. The 
IN with the Scheduler form the Internal Interconnection 
Module (IIM). Each module interacts with the Scheduler 
through a pair of “Request” and “Grant” signals. Each 
module embeds an Internal Interface that is used for the 
communication of the module with the IN. The outgoing 
signals of all modules are multiplexed, and the output of 
the multiplexer is connected to the incoming signals of 
all modules. Hence the IN realizes all possible data and 
control paths among the modules and supports 
multicasting. The multicasting capability of IN is useful 
for applications that require processing on the same data 
from more than one module. 

IO modules may also accommodate physical layer 
functions such as clock recovery, data alignment, serial to 
parallel conversion etc. 

The Management Processor consists of the CPU, local 
memory and the interfaces to the system and to external 
world. The main tasks of the CPU include system 
configuration, usage statistics collection from the M> 
(such as number of queues, number of packets per queue, 
etc.), complex non-real-time algorithm execution and 
host processor communication. 


== Control Bus xternal Memory I/F 


~~ Data Bus 


Figure 1. The IN-RAM architecture. 


The Connection Processor module involves a pipeline 
unit to execute data operations at real time (wire speed of 
the I/O modules). It accomplishes packet-header 
translation, control packet identification, traffic 
descriptor calculations and other related processing 
functions as required by the applications [4], [6]. At the 
input side functions accomplished are: audio/visual 
stream identification, audio/visual component extraction, 
transcoding. At the output side it performs audio and 
video component correlation, audio visual packetization 
and output stream generation (multiplexing). 

The Shaper Processor module performs the flow control. 
It realizes a calendar structure. This is based on a linked 
list of the existing streams/connections, which is realized 
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as a circular buffer within the processor’s local memory. 
Stream control is based on flow information coming 
either from the traffic calculations of the connection 
processor or the management processor. 

The M? is capable of both storing data (per flow queuing) 
and performing address processing. The M’ consists of 
the Memory Manager, the Memory Virtualizer, the 
Internal DRAM and the External Memory I/F. The 
Memory Manager performs the segmentation and the 
pointer maintenance for the multiple queue organization. 
The Memory Virtualizer hides the underlying memory 
structure (internal DRAM, external DRAM, external 
ROM, etc.) and organization (memory size, number of 
banks, etc.) from the Memory Manager, thus providing a 
virtual memory space. The Internal DRAM consists of 
DRAM banks with architecture independent size and 
organization. For applications requiring larger memory 
space, external memory can be added through the 
External Memory I/F. 


IV. VIDEO SERVER APPLICATIONS AND 
PERFORMANCE ANALYSIS 
A. Video multiplexer properties 


The Video and Communication Controller provides vast 
storage capabilities suitable for the buffering of a large 
number of video streams simultaneously. The number of 
multiplexed streams is proportional to the bandwidth of 
the physical interfaces. With 622 Mbits/sec throughput 
per port 200-300 streams can be handled concurrently. 
The storage capabilities, based on weighted fair queuing, 
provide the means for buffering each video stream in a 
dedicated queue inside the Memory Manager Module. 
Hence video transmission can be handled on a per stream 
basis. Furthermore, the real time processing in the 
Connection Processor can be used for bandwidth metrics 
in each incoming stream, in order to calibrate the 
throughput required on the output for each stream 
dynamically. This real time feedback mechanism along 
with the Shaper Processor, that allocates bandwidth for 
each video stream, forms a rate controller that feeds the 
client of the video server with the proper quality of video 
data according to the requirements of each scene. This 
work considers two (2) variations of video multiplexing 
techniques. The first estimates video activity and the 
second measures the length of the currently processed 
Group of Pictures (GOP), both operating in a per video 
stream basis: 

e Estimating video activity: The length of the next 
incoming GOP is predicted utilizing the knowledge 
of the actual length of previous GOPs, the error in 
the last estimation and, if available, additional 
knowledge of the source properties [2]. 

e Real time update: Determining video activity is 
accomplished by measuring the size/length of each 
incoming GOP. The input interface of the 
architecture, while retrieving information regarding 
the identification of the stream, computes the length 


of the current GOP. This computation enables 
calculation of the current mean data rate for the 
specific source. 


B. Architecture Operations and Performance 
Analysis 


1) Mapping of streaming calculations on the 

controller architecture 
The Connection Processor (CP) after calculating the 
requested mean data rate of the last incoming GOP of 
each source updates the traffic descriptor tables of the 
Shaper processor. 
The Connection Processor should determine the 
beginning and the end of each incoming frame. This is 
accomplished by observing the respective bit in the ATM 
cell (AALS packets). The CP integrates timers to 
countdown the time interval of the current GOP and 
registers to hold the number of cells stored of the current 
GOP of each incoming stream. Thus, at the end of each 
time interval, the stored amount of cells determines the 
length of the current GOP, enabling the calculation of the 
mean data rate to be assigned to the respective stream. 


Intervals 


Algorithms 


Update traffic 
descriptor 


GOPi length 


Shared Bus Interface 


Shared Bus 


Figure 2: Update traffic descriptor mechanism 


Figure 2 illustrates the Connection Processor sub- 
modules that dynamically allocate bandwidth to video 
sources. It should be noted here that the 'Model 
Algorithms' sub-module can either perform in prediction 
or deterministic mode. 


2) Performance analysis 
The Shaper processor allocates bandwidth per video 
stream according to the measurements of the Connection 
processor. The bandwidth required per stream’s GOP [2] 
changes according to the activity of the scene [14]. 
Typical bandwidth requirements for low, medium and 
high activity scenes (PAL quality) are 0.07-0.12Mbps, 
0.8-1.3Mbps and 3.1-3.9Mbps respectively 
The operation of the Shaper processor when 300 sources 
are multiplexed has been simulated. The simulations 
performed consider that 10% of the total number of 
videos present low activity scenes, 70% present medium 
activity scenes and 20% present high activity scenes. 
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Hence the video streams’ rates can be divided in three 
different categories, each one consisting of tens or 
hundreds of video streams. Priorities are assigned to 
sources according to their category. Streams that present 
high activity scenes have higher priorities and streams 
that present low activity scenes have lower priorities. The 
aggregate throughput required by all video streams, in 
the performed simulations, is 420 Mbit/sec. Hence the 
utilization of the 622 Mbps physical link is 70%. 

In order to decrease the Cell Delay Variation (CDV) the 
write pointer of the calendar can search backwards as 
well as forward, from the calendar position that must be 
occupied by the id of a stream. The results of this 
technique, named backward search, are also presented in 
the graphs. 

Figure 3 presents the mean CDV for each multiplexed 
video stream. On the x-axis the video streams are sorted 
by their priorities. On the y-axis the mean CDV values 
(in cells) are depicted. The mean CDV represents the 
time jitter from “correct” transmission time of each cell, 
as it is described in [1]. As it is presented in Figure 3 the 
mean CDV values are decreased by a factor of four (4) 
when the backward search is enabled. 

High activity 


Mean CDV per source —_ Low activity 


Medium activity scenes 


Mean CDV (cells) 


KERMOOKNMOOTNOM 
- TODD  NYORDON fe) 
Se ee NON CIONECICN 
Highest geo =ueams Lowest 
priority stream --~-~- Backward search disabled 


Backward search enabled 


Figure 3: Mean CDV of video streams 


Ve CONCLUSIONS 


This paper has presented a generic “system on a chip” 
architecture, capable of performing stream data buffering 
and algorithm/protocol processing. The architecture 
solves a number of problems ranging from 
algorithm/protocol processing at high speed for gigabit 
networks to buffering thousands of connections into a 
single, integrated and silicon efficient component. 
Furthermore, this work has shown that the architecture 
posses the necessary attributes to allow the 
implementation of interworking functions at high speed. 
Last but not least the architecture is technology 
independent, scalable (both in terms of speed and 
memory capacity) and easily implemented as a set of 
independent hardware building blocks. 
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Abstract - Considering the transition of political and economical 
system in the 90’s, with negative influence of political 
instabilities in region, “Croatian Telecom”, as a successor of 
Sormer PTT, is facing greater demands for data services in 
recent years. Evolution of data services in Croatia and their state 
before introduction of ATM based public multiservice network 
are presented briefly in this paper. Consolidation of current data 
networks was one of the first premises for this project. 
Preparation for competition in telecom market enforces the 
introduction of new services and their nation-wide availability. 
Intensive study of user demand trends was done, considering 
services and bandwidth. The need for Frame Relay service, as 
well as customers’ bandwidth demands is dramatically 
increasing. Without dedicated Frame Relay service network, 
“Croatian Telecom” is facing the necessity of building new 
network which has to be open and adaptable for services which 
are yet to come and in the same time has to satisfy current users 
demands. It led us to the multiservice network design based on 
“El premise” and to the distribution of service access points in 
favour of Frame Relay. The principle of Croatian ATM based 
multiservice network designs is described, and target network is 
Shown. This network is considered as the important step towards 
planned introduction of “turn key solution” services for 
corporate customers. Verified results from this project could be 
used as a framework for similar activities in relatively small 
national telecommunication networks in our region. 


Index Terms - data networks, ATM, multiservice network, 
Frame Relay, broadband communications 


I. INTRODUCTION 


The strong orientation of European countries toward an 
open market economy, followed by liberalisation, 
deregulation, and competition in telecommunication field 
has induced the rapid development of telecommunications 
in so-called transition countries. Croatia, a central European 
and Mediterranean country with an area of about 60.000 
km? and about 5 million inhabitants, is an interesting 
example of such a country. After the transition of political 
system in Croatia in 1990 and gained independence in 1991 
the course was set toward the open market economy. The 
development of telecommunications was particularly fast 
regarding total national economical potential and ongoing 
war in Croatia in 1991-95, from both, technical and 
economical point of view [1]. “Croatian Telecom” (HT) is a 
legal successor of “Croatian Post and Telecommunications” 


Croatian Telecom 
Non-voice Services Division, 
Draskovi¢eva 26, HR-10000 Zagreb, Croatia 


(HPT), a state-owned enterprise founded in October 1990, 
in the field of telecommunications. “Croatian Telecom” was 
founded as a joint stock company by the decision of the 
Croatian Parliament in July 1998, and started to operate on 
January 1, 1999. The Croatian Institute for 
Telecommunications was established on January 1, 2000 for 
regulatory functions in the telecommunications area. At 
present, “Croatian Telecom” operates all public 
telecommunications services in Croatia. The 
telecommunications law specifies that until the December 
31, 2002 “Croatian Telecom” has the monopoly on fixed 
telecommunications network, and is the only one permitted 
to provide international telecommunications services. The 
licences are issued for competitors in the fields of mobile 
communications (paging and GSM) and data services. Total 
market percentage of competitors in data services is 
currently insignificant. 


II. PRESUMPTIONS AND GOALS 


The data services (with exception of IP services) were up to 
1999 characterised by the old network technologies (trailing 
significantly behind developed countries), unfavourable 
structure of services (90 % of all data services consists of 
analogue leased lines and 94 % of all connections were 
unmanaged connections), old-fashioned and non-balanced 
price list (stimulating the usage of physical copper pairs). 
Up to 1998 this area was also characterised by poorly 
organised business process (e.g. sales were performed in 
one department, network provisioning and service creation 
in other, and service & network management in regional 
centres). By establishing the separate business unit for data 
and telex services (Non-voice Services Division) the 
preconditions for major improvement in this area were 
founded and the first results of this were shown in 1999. 
For the first time all segments of the data services business 
are gathered in one business unit, which has for the first 
time enabled creation of the meaningful business and 
technological strategy and exploitation in this area. 

The big companies in Croatia are accustomed to running 
their own networks based on leased lines. Besides of the 
unmanaged leased line services, there is also a dedicated 
PSPDN network called CROAPAK, which is basically a 
X.25 network, and it has been operating for more than 10 
years. It was the infrastructure of modern data services in 
the past, but today it is dedicated mostly for low speed X.25 
protocol transfers for specific customers such as registers of 
the National Lottery, cash machines (ATMs), bank desk 
operations, and POS terminals. Number of user connections 
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in CROAPAK network is showing an annual growth of 
around 10%. More demanding customers have migrated to 
the Frame Relay protocol and more powerful CROLINE 
network. 

CROLINE network, in operation since 1996, is basically a 
manageable network of leased lines (TDM) with Frame 
Relay switching capabilities at the same time. Due to the 
growing demand for Frame Relay services the number of 
Frame Relay users surpasses the number of TDM users on 
this network. Currently the network is at its limits and even 
overloaded to some extent Because of the applied 
technology (2 Mb/s backbone links) further extensions are 
not possible and not justified. The network does not support 
bit rates over 2 Mb/s and even the number of 2 Mb/s user 
ports is fairly limited by the overall capacities of the 
backbone. Therefore, during the 1998 planning started for 
introduction of a new network. 

Smady of user demands, as well as trends seen in developed 
countries, brought us to several conclusions: 

- average customers’ bandwidth demands for services 
provided by dedicated networks are almost doubled 
every year, 

- user demands for managed leased Imes are at least 
doubled every year, 

- user demands for Frame Relay service are dramatically 
increasing (Fig. 1). 

- we are witmessmg the migration of IT business 
applications from character-based to windows-based, 
and these applications mcludes more multimedia 
contents (data, voice, images. video), 

- despite leased lines, “Croatian Telecom” does not have 
any data services of 2 Mb/s level and higher, 

- after years of development, ATM is recognised and 
justified at most operators as the most promising 
backbone technology, as well as a service for 
converging voice, video and data into one network, 

- independent analytical sources predicted substantial 
growth of Frame Relay, ATM and IP services at the 
European telecommunications market. 


COOTOM commecters BFR commections <O= Tote! number of connections 


Fig. 1. Number of TDM and Frame Relay comnections in CROLINE 
network af the end of years 1997_ 1998 and 1990 
Considering those facts, “Croatian Telecom™ was facing the 
necessity of buildimg 2 new network. The important 
question was whether to build dedicated network for Frame 
Relay service and get stack with one service or try to go a 


satisfy current users demands [2]. We decided in favour of 
second option and it led us to the broadband multiservice 
network design based on ATM backbone. 

The necessity for building such a broadband multiservice 
network based on ATM was recognised and justified by 
several reasons: 

- consolidation of legacy services and introduction of 
new services on a single network, 

- provisioning of broadband services (on level 2Mb/s 
and higher), 

- offering flexible and customised solutions (due to wide 
spectrum of access interfaces), 

- delivering value added services (multimedia services, 
Internet access, leased line service over ATM, VPN, 
etc.), 

- countrywide availability of data services. 


Ii]. NETWORK AND SERVICE DESIGN 


The “Public multiservice broadband network”, as it was 
named in its project phase, was designed on a two-level 
concept, where the first (upper) level is called "Public ATM 
network” and second is called "User domain" (Fig. 2). 
Public ATM network level consists of ATM core switches 
and ATM edge devices interconnected with dark fibre or 
over an SDH backbone. Data traffic volume and user 
distribution analysis showed us the following distribution of 
traffic and users in 4 regional centres: 

- 60% m Zagreb region, 

- 15% in Rijeka region, 

- 15% im Split region and 

- 10% in Osijek region. - 

This led us to topological structure of core ATM 

that is shown in Fig. 3. It consists of 5 ATM switches that 
are interconnected with STM-~4 (622 Mb/s) and STM-1 
(i535 Mb/s) links and a NMS for the management and 
exploitation of the network. 

User connections access the "Public ATM network" directly 
through the ATM core switch ports or via ATM edge 
concentrators, via the PDH/SDH infrastructure. The core 
switches could accommodate ATM user access on E3 (34 
Mb/s) or STM-1 level. It is rather obvious that there will be 
only a few potential users for such a service (beside 
ourselves — e.g. backbone for IP services) so we do not 
consider it dangerous to connect users at core switches. All 
other user access, at level below E3, has to be made via the 
ATM distributive level. 

Distributive level consists of 30 ATM edge concentrators, 
equipped with up to 80 El (2 Mb/s) interfaces for the 
reception of different kind of user traffic. The purpose of 
these concentrators is: 

- to be flexible in providing interfaces for narrowband 
and medium-band non-ATM and ATM services, 
without consuming core switch slots, 

- grooming of traffic prior to entering the core layer, 

- conversion of non-ATM traffic to ATM traffic by 
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- concentration of the traffic onto the broadband port at 
ATM core switch (via STM-1 up-link). 


PUBLIC ATM NETWORK 


CORE 
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Fig. 2. "Two-level" concept 


Those multiservice devices will start with only half of the 
available capacity (40 out of 80 user ports), with whole 
spectrum of service modules (channelised and non- 
channelised El Frame Relay, structured and unstructured 
El Circuit Emulation and El ATM UNI with IMA 
possibilities). Decision to use half-filled chassis gives us 
flexibility to install additional service modules according to 
the demand for services. 

Besides the basic functional characteristics of ATM 
switches, the network supports: 

- different traffic types: CBR, VBR-rt, VBR-nrt, UBR 

and ABR; 

- quality of service (QoS) parameters that can be applied 

to different kinds of virtual connections; 

- advanced Class of Service (CoS) management; 

- PVC and SVC connections for ATM and Frame Relay. 
Having in mind that user demand for Frame Relay service 
is very strong, the network is able to offer different service 
levels and user adaptivity of Frame Relay service with 
strong monitoring and statistical data collection at Frame 
Relay level and it fully supports FRF.5 and FRF.8 
interworking with QoS. 

Considering our goal to build the network which is open for 
promising services that are not currently in our customers’ 
focus, the target network is capable of supporting MPLS 
which is today a widely accepted solution for IP traffic flow 
over ATM. 

For the design of the second network level, which is in our 
project called "User domain", the “El premise” was 
followed. This premise is established as a result of 
transmission network characteristics that enables El (2 
Mb/s) bandwidth to be easily available at almost every 
potential customer premises in Croatia. “User domain” 
level has to deal with user access at El level and below, for 
services which are not currently available in the whole 
Croatia. The question is raised about users whose current 


bandwidth demands are still far below E1. Despite the tariff 
policy that stimulates El connections, E1 is not fully 
justified bandwidth for a great amount of users and their 
applications. For this purpose, the El multiplexers as well 
as HDSL modems are considered. The multiplexers are 
positioned at the edge of the telecommunication access 
network (mostly at RSS - Remote Subscriber Stage 
locations); according to demands for two and more user 
connections whose aggregate bandwidth does not exceed 
E1. These multiplexers groom n x 64 kb/s into a 2 Mb/s 
stream that is transferred to the port at ATM edge 
concentrator via the PDH/SDH infrastructure. This is, in 
fact, fixed bandwidth transfer from customer premises to 
the ATM edge concentrator (service provisioning point). 
It is obvious that usage of multiplex equipment is not cost 
effective in areas where there is only a single customer 
present. For such a case, the usage of HDSL equipment, 
which provides data transfer at less then or at El level, is 
planned as a "last mile" solution. 


International International 
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Fig. 3. Public ATM network 


The current network offers flexibility to connect different 
users in the means of services, bandwidth and QoS needs. 
In the network design process and in the evaluation of 
equipment available on the market, the modularity of 
equipment and scalability of the total solution were among 
the main preferences. The network could be easily 
expanded with installation of additional user access ports by 
200%, and core switches are capable of further expansion 
of their switching matrix to double throughput value, which 
ensures a non-blocking core ATM network solution. 


ITV. PLANS AND TRENDS 


On the June 1, 1999, the new price list for data services has 
been approved as a result of the new strategy and as a first 
step of its realisation: analogue pairs leased lines were 
removed from the price list — the price list recognises only 
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bandwidths and services, not the physical infrastructure. By 
selling bandwidths instead of physical lines, and by selling 
managed digital connections instead of unmanaged 
analogue connections the intention is to establish the 
control over the usage of the network capacities in respect 
to the forthcoming competition. At the same time, we 
would like to encourage the usage of higher quality and 
higher revenue level services. Considering before 
mentioned explanation of “El premise”, as well as the 
trends for corporate customers to migrate their existing 
networks toward El level, the tariff structure will strongly 
stimulate the users who decide to use El bandwidth. 
Considering the current state of Croatian economy, user 
demand and market prices of customer premises equipment, 
distribution of user access ports in this network is in favour 
of Frame Relay vs. ATM. In fact, on El level and higher, 
ratio of access ports for ATM, Circuit Emulation (CE) and 
Frame Relay service is roughly 1:1:2 (this ratio for user 
connections will be even bigger when we include in 
calculation usage of channelised El FR and structured El 
CE ports). This multiservice approach gives us the ability to 
use the same equipment (chassis) for accommodation of 
multiple services with efficient ATM backbone. In the 
beginning stage, the main purpose of this network is to 
accommodate growth of Frame Relay service users, and to 
introduce ATM services. For the start, we planned to 
introduce CBR and VBR-nrt ATM service classes. In the 
same time we expect from multiservice platform the 
possibility to introduce new services easily and to manage 
multiple services from the same platform, giving us 
improved “time to market” and therefore “time to revenue”. 
Service Level Agreements (SLAs) do not exist now, 
although we see them as an important upgrade in our 
relationship with customers and market value. There are 
some necessary steps before their establishment: dedicated 
personnel in 4 main regional centres; establishment of 
mechanism for responsibility for quality and prompt 
establishment and maintenance of physical leased lines in 
all telecommunication centres; full operation of new 
broadband network and flexible billing system. SLAs are 
foreseen for the second half of year 2000. 

In the year 2000 we also plan to start with development of 
total solutions for customer private networks including 
equipment leasing and maintenance of services on the 
customer premises. This will also be the year of 
introduction of different xDSL technologies (ADSL and 
SDSL) in access network. 


V. CONCLUSION 


Deregulation and liberalisation make today’s 
telecommunications a very open market. New companies 
emerge on this market, and in the segments of very 
profitable services there will be (and are) a great number of 
service providers. The position of so-called traditional 
telecom operators (former monopolies) on the liberalised 
market will be dependent on their investment in broadband 
network infrastructure and services. 
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Full-service operators which tend to be the former telecom 
monopolies and so-called supercarrier alliances are likely to 
have a more varied technology repertoire than for example 
enterprises emerging from ISP background. It is widely 
recognised that traditional telecoms are more ATM 
influenced, because ATM allows them better management 
of different information flows and traffic types (data, video, 
voice). Considering development and transition toward 
ATM networks, telecom operators have to define their 
priorities about migration path, securing investments and 
traditional income from legacy services. 

The multiservice broadband network described in this paper 
offers new broadband services and user interfaces on 
Croatian market in the range from 2Mb/s to 155Mb/s. It 
also supports narrowband data services, integration of 
existing data networks and also existing customer premises 
equipment usage. 

The year 2000 will be the year in which “Croatian 
Telecom” will be in position to support expected growth of 
Frame Relay service and the very start of ATM services. 
Typical Frame Relay customers will be private networks 
with 10 to 30 connections at the speed of 64 to 256 kb/s and 
single or two central locations connected with 2 or 34 Mb/s. 
The pricing scheme stimulates 2 Mb/s access points rather 
than lower speeds, because they are technically simpler for 
operator and most perspective solution regarding the future 
needs of the customer. 

Scalability of defined network elements (easy upgrade of 
ATM switches and ATM edge concentrators), their 
distribution possibilities according to customer demands 
and available modem and multiplex equipment will expand 
availability of advanced data services to almost every 
corner of Croatia. These facts are considered as important 
steps in data services evolution in “Croatian Telecom”; 
which will make our company more open and adaptable to 
user demands and which will at the same time secure our 
investment. 

Future development of telecommunications in Croatia will 
be based on its own experiences as well as those of other 
countries in transition and developed countries. Recent 
related legislative changes as well as changes in the 
ownership structure of “Croatian Telecom” (in October 
1999. Deutsche Telecom purchased a 35 percent share) will 
surely have a significant impact on the future development 
of telecommunications in Croatia. 
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ABSTRACT - The growing demand in broadband services, 
be it for example the Fast Internet, Multimedia Services, 
Video Broadcast Services, calls for the re-examination of the 
technology platforms deployed in a future-proof access 
network. Certainly, the cabled medium to carry such 
services is an all optical Fiber To The Home (FTTH) 
network. Such a high speed access network could 
theoretically provide the complete range of all currently 
known and new services being discussed for the residential 
subscribers and business customers. 


However, due to the cost of deploying such technology and 
due to the immaturity of the technology, incumbents are 
driven towards harnessing existing copper infrastructures to 
the maximum extent possible. In this paper, we present the 
work carried out by the Cyprus Telecommunications 
Authority (CYTA) in the field of broadband service delivery 
over ADSL and highlight critical issues for doing so in a 
field trial environment, whereby multi-vendor equipment 
and multiple Service Provider presence are a reality. In the 
field trial, both Full Rate ADSL and ADSL Lite are 
deployed. 


I. INTRODUCTION 


The real challenge and ultimate goal in the 
communications industry, is for the Network Access 
Providers (NAP) to be-able to provide customers with a 
variety of versatile broadband services, developed in the 
world of Information Technology (IT). Such services 
include Fast Internet, Video on Demand, access to 
Multimedia Libraries, Tele-Conferencing, Tele-Working 
and Tele-Shopping, Interactive Services, Virtual Private 
Networks (VPNs), E-Commerce and many more. Most 
of these applications work with a typical client-server 
model, where the majority of information is sent 
downstream toward the client. Until recently, the 
limitations for delivering broadband services was held 
back by the capability of the technology to communicate 
high bit rates over the copper medium. Copper has 
traditionally been the last mile medium providing access 
to traditional services such as POTs and ISDN from 
homes and offices. With the appearance of Asymmetric 
Digital Subscriber Loop Technology (ADSL) speeds up 
to 8Mbps downstream and 640kbps upstream can be 
achieved over copper (depending on attenuation and 
interference levels). The ADSL technology, therefore, 
targets both the Small Office Home Office (SOHO) users, 
as well as the wider Information Technology (IT) 
consumer market. 


The ADSL technology has recently been standardised by 
ITU-T in two versions, namely the G.992.1 (G.dmt) [1] 
and the G.992.2 (G.Lite) [2]. In addition, a handshake 
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procedure to identify different flavors of DSL has been 
studied under Recommendation G.994.1 (3). 


There are a lot of differences between the G.Lite standard 
and the G.dmt. The G.Lite has been designed with a 
focus on achieving a low-cost DSL, fully interoperable 
amongst vendors and simple to install in order to.achieve 
widespread usage. It is, therefore, of lower complexity 
and operates at lower transmission throughputs. It does 
not require a centralised splitter to prevent the 
interference between the POTs devices in the home and 
G.Lite modems and therefore, does not require in-house 
rewiring. However, the achieved data rate by G.Lite 
depends not only on the length of the local loop (from the 
Central Office to the home), but also on the in-home 
wiring conditions and the connected POTs devices. 
Under good home wiring conditions and loop quality, 
1.5Mbps downstream and 512kbps upstream is 
achievable at distances of around 5km. The G.Lite 
includes features like, idle mode and power management; 
fast retrain procedures; time compressed multiplex ISDN 
crosstalk environment; and interoperability between 
different equipment vendors. 


CYTA currently holds a field trial to test broadband 
service provisioning over low-cost ADSL. The 
objectives of the field trial have been the following: 

1. To investigate the features and availability of 
equipment (especially G.Lite). 

2. To investigate the ‘no truck-roll issue’. This issue is 
crucial for the mass deployment of ADSL. It relaxes 
the need to visit customer premises, in order to 
rewire the customer premises internal network, or to 
configure the modem. 

3. To test the interoperability of ADSL modems (ATU- 
Rs) from various vendors. This issue, related to the 
‘no truck-roll issue’, is also critical as it proves the 
possibility of customers to buy any type of ADSL 
modem and yet.to be able to connect it to the 
Network Access Provider’s (NAP) Digital Subscriber 
Line Access Multiplexer (DSLAM) which could be 
of another manufacturer with a different chipset. It 
should be also possible to manage and configure the 
ADSL modem from the Central Office (CO) 
irrespective of the type of DSLAM. Interoperability 
tests of this type are being carried out by the 
University of New Hampshire [4]. 

4. To evaluate the effects of adding several new ADSL 
lines (or other xDSLs) in the same copper cable. 

5. To investigate the interoperability of DSLAMs and 
Broadband Access Servers (BAS). (Interoperability 
of DSLAMs with BAS is an interesting issue for the 
NAP.) 

6. To integrate and evaluate Dynamic Service 
Selection. This allows the user to dynamically select 
his Internet Service Provider (ISP), Corporate 
Network or other source of service/content. 
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7. To integrate Authentication (user identification by 
password, calling number,...), Authorisation (control 
of access by users to class of service, corporate 
network access management,...) and Accounting 
(billing and accounting towards end-users and 
customer, such as ISP and Corporate, by using 
various profiles e.g. flat, time, volume, services, IP, 
address, ...) using a RADIUS server, for the purposes 
of providing such information to both the NAP and 
the Network Service Provider (NSP). 


Il. FIELD TRIAL SETUP 


The field trial setup is shown in figure 1. Equipment 
from three different vendors is used in the field tal. 
Each vendor provides two DSLAMs and forty ATU-Rs. 
The ATU-Rs are available in four different versions: 
ATM-Forum (25,6Mbps) PC interface, Ethernet 10BaseT 
PC interface (some with router functionality), NIC card 
and USB interfaces. The majority of ATU-Rs deployed, 
are of the Full Rate ADSL type, in accordance with 
G.992.1, whilst the rest are compatible with the ADSL 
Lite G.992.2 standard. The ATU-Rs are installed at 
customer premises using centralised splitters, distributed 
filters or no filters. Also, some of the ADSLs and 
splitters are able to provide ISDN BRA. 


Figure 1 ADSL field trial test setup 
The ATU-Rs are connected via the local loop to the 
splitter shelf at the Central Office. The splitter at the CO, 
splits and diverts the narrowband POTs/ISDN BRA 
traffic to the PSTN/ISDN exchange. The broadband 
traffic is diverted to the ADSL Line Termination Units- 
Central (ATU-C), which terminate the physical-layer 
DSL signals. The ATU-Cs are hosted in the DSLAM. 
The DSLAM has additional functionality for ATM 
multiplexing, traffic concentration, buffering and queuing 
[5]. Also, from the DSLAM, the Quality of Service (QoS) 
associated with service profiles, traffic shaping and 
policing, are carried out on a per customer basis [6]. The 
DSLAMs are further connected to the Broadband Access 
Server (BAS), either directly, or over the broadband 
public transport, or ATM network. 


The Broadband Access Server (BAS) acts as a Local 


Access Concentration (LAC) and a Service Selection 
Gateway (SSG). The BAS offers to subscribers the same 
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behavior in broadband environment as in narrowband 

dial-up Internet Access. The main features of the BAS 

are: 

1. ADSL subscriber traffic aggregation. 

2. Traffic partitioning in Virtual Private Networks 

(VPN) dedicated for service providers/content 

providers and services. Each VPN can be configured 

independently. 

Subscriber service selection during session 

establishment. 

4. Protocol adaptation: supported protocols are 
RFC1483 bridged and routed encapsulation, PPP 
over ATM, PPTP, PPP aggregation and tunneling 
over L2TP. 

5. Network adaptation for offering the proper interface 
to each Service Provider. 

6. Authentication, Authorisation, Accounting, (AAA) 
via the RADIUS protocol in conjunction with a 
Service Management Center (SMC). Implementation 
enables either all AAA requests to be initiated by the 
Network Access Provider, or that all AAA requests 
are forwarded via the RADIUS protocol to the 
Service Providers. 


tad 


The Service Management Center provides: 


1. Customer and services complete management 
2. WPN management 

3. AAA functionality 

4. 


Data collection for statistics and marketing actions 


Ill. PROTOCOL STACK CONVERSION FOR END-TO-END 
SERVICE PROVISIONING 


Once an ATM layer connection is established between 
the customer premises and the service provider network, 
the session setup and release phases at the network level 
can be established using PPP [7], similar to existing 
analog dialup scenarios (for Internet access and 
Telecommuting). With PPP over ATM, essential 
operational functions can be delivered over ATM by 
leveraging the functionalities already provided in PPP 
(authentication, Layer 3 address auto-configuration, 
transport of different Layer 3 protocols encryption, 
compression, billing). 


A. ‘PPP over ATM’ 
This architecture specifies that PPP is carried over ATM 
using ATM adaptation layer 5 (AALS) based on VC 
multiplexing [5,8]. The mapping of PPP over AALS uses 
null encapsulation. When a user initiates a PPP call, data 
1s transported from the user premises over a PVC to the 
BAS (service gateway). After momentarily terminating 
the call, the service gateway ‘extends’ the PPP through an 
L2TP tunnel ending at the target NSP. L2TP allows 
multiple PPP calls to be multiplexed over each tunnel, 
therefore, there is no need to set up a tunnel, or a PVC 
between the service gateway and the NSP for every user. 
Rather, one tunnel over one PVC, is sufficient. More 
tunnels can be established over one or multiple PVCs, in 
order to satisfy different QoS levels, or for security 
considerations. With this model, users can easily 
terminate the PPP session with another provider 
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dynamically, without costly and time-consuming 
intervention of the NAP. 


B. ‘PPP over Ethernet’ 


The PPPoE solution defines a new layer, called PPPoE, at 
the host protocol stack. This layer glues the PPP layer to 
the Ethernet layer. In this solution a PPP packet is 
encapsulated in an Ethernet frame and transmitted to the 
ATU-R. The Ethernet frame is then forwarded over the 
ATM VC until reaching the service gateway. At the 
service gateway, the PPP packet is extracted and 
forwarded over the appropriate L2TP tunnel to the target 
NSP. 


IV. SERVICES TO BE PROVIDED 


The services will be delivered to the customer by 
connection to a Service Provider, Corporate Network and 
the Cyprus University which will act as another Service 
Provider. Selection of the various services is facilitated 
by the Service Selection Gateway. The friendly user shall 
have access to the following services 


A. Service Provider 


1. Fast Internet 

2. Video Conferencing 

3. Video on Demand 

4. Real Time TV Streaming 
pee TV 


B. Corporate Network 


Tele-working 

Business applications exploiting Corporate Databases 
VLAN, VPN applications 

VoIP/VoATM 


nasty ad ae 


C. Cyprus University 
1. Multimedia Services 


QoS Levels, billing scenarios, AAA and security 
mechanisms and market acceptance and penetration at 
various tariff levels for the various services provided shall 
be investigated. 


V. SERVICE PROVISIONING ISSUES 


Several issues have to be considered prior to large scale 
ADSL Full Rate and ADSL Lite deployments. Some of 
these issues have already been extensively covered, such 
as noise and crosstalk environment for ADSL systems 
and xDSL line qualification and testing [9,10]. However, 
there remains to be seen and assessed, problems like 
interoperability, effects of splitterless applications and 
home wiring, especially for G.Lite (which is just 
appearing on a commercial basis). Also, other 
deployment issues for evaluation and testing during the 
field trials have been stated in Section I of this article. 


Following, is a description on the main aspects of testing 
that concern the ADSL access. 


A. Test Specification 


The Field Trials have been concentrating primarily on 
testing two crucial aspects of ADSL Lite, namely 
interoperability and the effects of the splitterless solution. 
The success of ADSL Lite deployment depends on a fully 
standardised technology and complete cross-vendor 
interoperability including interoperability on a chip-set 
level. It depends also on the ability of the ADSL Lite to 
function properly on a variety of customer premises 
wiring without being affected, or affecting, other 
narrowband services on the line. 


Interoperability testing may be defined further as Static or 
Dynamic Testing [5]. 


1) Static Interoperability 
An ATU-C and an ATU-R are statically interoperable if 
they can implement a common and compatible set of 
features, functions and options and can demonstrate 
satisfactorily mutual communication in a benign 
environment (i.e., over a NULL loop with no noise 
intrusions). 


2) Dynamic Interoperability 
An ATU-C and an ATU-R are dynamically interoperable 
if they implement a common and compatible set of 
features, functions and options and can demonstrate 
satisfactorily mutual communication in a real network 
architecture environment as performance test conditions 
are varied and exercised. 


The end goal for the proposed testing is to create 
confidence that successfully-tested products will provide 
reasonable performance when connected to similar 
successfully tested equipment and when deployed ‘in the 
field’ on reasonable copper loops. The tests are covering 
the following: 

1. Physical Layer (ATU-R to ATU-C) - Layer 1 

2. ATM Layer (ATU-R to ATM node) - Layer 2 

3. Installation Deployment and Management issues 


3) Test Conditions 

The conditions affecting the test results are the following: 

1. Loop conditions (length, wire gauge, etc) 

2. Customer premises wiring 

3. PSTN conditions: ringing, ring-tip, battery feed, 
signaling 

4. Co-channel interference (BR-ISDN, HDSL, ADSL 
etc) 

5. Other interference: impulse noise, RFI, etc. 


The above conditions are evaluated (characterised) and 
correlated with the results of measurement of the 
performance of the ADSL Lite, in order to make 
conclusions regarding the dependence of ADSL Lite 
performance on the above listed parameters. Hence, a 
characterisation of home wiring, phone devices used, 
microfilter usage, local loop conditions, chip-set used, are 
also noted with the tests. 
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4) Measurement Parameters 
Measurement parameters include, maximum achievable 
downstream and upstream rates with range, installation 
time required to get the ATU-Rs working at customer 
premises either by the customer or network provider, 
degradation in POTs service, Bit Error Rates (BER), 
ATM traffic shaping, policing and Quality of Service for 
the ADSL traffic. 


At a lower level, measurement parameters may include 

the following: 

Upstream and downstream transmission rates 

Line attenuation 

Noise margin 

ADSL line status 

Transmitted FEC blocks 

Corrected FEC blocks 

Uncorrectable FEC blocks 

Transmitted superframes 

Uncorrectable superframes 

0. Counters for current and previous loss of signal, loss 

of frame, loss of power, errored seconds 

11. Counters for current and previous loss of signal, loss 
of frame, loss of power 

12. Bits/Carrier (DMT) 

13. Interleave delay 

14. Rate adaptation (dynamic rate repartitioning, 
dynamic rate adaptation, and fast retrain) 


HORNA WN 


5) End-to-End Performance Parameters 
To the providers, performance through the network to the 
user is critical. The following parameters are being 
accounted for. 
1. Service Quality 
2. Latency 
3. Bandwidth and Data Rate Availability 


6) Levels of Interoperability 
Interoperability at a particular layer can be evaluated in 
either a static or a dynamic interoperability test scenario. 
1. Physical layer 
2. Interoperability at the TC layer 
3. ATM over ADSL Layer 
4. Higher Layers of Interoperability 
- PPP over ATM on ADSL 
- Frame-based architectures (e.g., FUNI on 
ADSL or HDLC on ADSL) 


B. Examination of Commercial Deployment Issues 


Other issues to be placed under investigation before going 
to a full provisioning of broadband services over ADSL 


are the following: 

1. Service Selection and AAA issues 

2. Billing procedures 

3. Deployment procedures including line evaluation 
4. Multi vendor interoperability of BAS, DSLAM and 


ATU-R 
5. Service delivery and Quality of Services 
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VI. CONCLUDING REMARKS 


A full field trial system setup has been presented based on 
commercially available equipment. The use of multi- 
vendor equipment and multi-Service Provider simulated 
environment allows full interoperability and performance 
testing. It also enables deployment issues to be 
investigated and deployment procedures related to end-to- 
end service provisioning, like line qualification, ADSL 
connection, Service Profiles, authentication, 
authorisation, billing and dynamic service selection to be 
studied and defined. 


Field trial tests regarding ADSL Lite deployment must 
encompass investigation, measurement and evaluation of 
the ADSL Lite to meet the primary design objective: No 
truck roll installation (user installable) meaning no need 
for house re-wiring and no need for a technician to install 
the modem and splitters. Various issues associated with 
testing of interoperability, evaluation of the effect of 
customer premises wiring, test conditions and 
measurement parameters have been highlighted. 
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Abstract— The specification of a cost-effective Full Services 
Access Network (FSAN) has been the target and ambition of 
Telecommunications Operators. With such fully managed 
architecture the Telecommunications Operator can be sure of 
providing the variety of narrowband and broadband services 
including Digital Broadcast services, Video on Demand, IP 
services and traditional telecommunication services including 
POTs, ISDN, ATM and Circuit Emulation. In 1999, 
EURESCOM has launched the BOBAN project which 
considers the evaluation and testing of new broadband access 
technologies in a field, or lab trial environment. A primary 
consideration is the evaluation of FSAN-like systems and the 
interconnection to service platforms for the provisioning of 
symmetric and asymmetric broadband services, including 
point-to-point and point-to-multipoint broadcast services. 
More specifically, the scope of part of this program is to 
evaluate, in the field or laboratory, FTTCab/B/H solutions 
with connections to service platforms. The document presents 
a description of this on-going project. 


I. INTRODUCTION 


Since June 1995, there has been an international initiative 
from telecommunications operators and manufacturers to 
work towards a consensus on the systems required in the 
local network to deliver a full set of telecommunications 
services, both narrowband and broadband [1]. This 
initiative has been called ‘FSAN’ and the broadband access 
platforms compatible to the FSAN requirements, FSAN- 
type. In spring 1999, Eurescom launched the P917 
“Building and Operating Broadband Access Networks- 
BOBAN’ project [2]. Part of this project has been the 
identification of FSAN-type system possibilities, the 
definition of services to be provided and the definition of 
trials and associated tests. Other elements which have been 
under study and related to broadband service delivery were, 
the development of the specification of a broadband general 
purpose cabinet to terminate optical aggregates and host 
VDSL drops, as well as Service and Network Management 
issues related to FSAN-type architectures. 


II. IDENTIFICATION OF FSAN-TYPE SYSTEM POSSIBILITIES 


Figure 1, shows the FSAN-type architecture possibilities. 
In these architectures, fiber is brought nearer to the 
customer. As shown in figure 1, the difference between the 


DR C.A. PANTJIAROS, ACCESS NETWORK DEPARTMENT, 
CYPRUS TELECOMMUNICATIONS AUTHORITY, P.O BOX 24929, 
NICOSIA, CYPRUS. 


architectures is in the positioning of the Optical Network 
Unit (ONU). This leads to Fiber to the Cabinet (FTTCab), 
Fiber to the Building/Curb (FTTB/C), Fiber to the Home 
(FTTH). 


Service PON Head Local Cabinet Kerb 
Node End Node — Exchange 


VBS 
VDSL or ADSL 


ott Kfonu 


OLT 


OLT 


ADSL - Asymmetric Digital Subscriber Line/Loop 
FTTB - Fibre To The Building 

FTTCab - Fibre To The Cabinet 

FTTEx - Fibre To The Exchange 

FTTH - Fibre To The Home 

FTIK - Fibre To The Kerb 


PON - Passive Optical Network 
VDSL - Very high-speed Digital Subscriber Line/Loop 


Figure 1 Fiber To The x architectures 


The reference configuration is stemming from ITU-T G.982 
and is shown in figure 2 [3]. The configuration is based on 
ATM over a Passive Optical Network (PON) in accordance 
with ITU-T G.983.1 [4]. 


Access Network Systern Management Functions 


SNI Service 
node 
ctio: 


(V,)/(V) reference point 


(a) reference point - This refererence point is added to differentiate 
(T3Y(T) reference point the AF from the ONU 


ONU: Optical Network Unit OLT: Optical Line Termination 
ODN: Optical Distribution Network AF: Adaptation Function 


Figure 2 Reference Configuration for an ATM based PON 


The main elements comprising an FSAN architecture are 
the Optical Line Termination Unit (OLT), the Optical 
Distribution Network (ODN), the Optical Network Unit and 
the Network Termination (NT). 


The OLT offers the tributary interface to an ATM network, 
and handles the opto-electronic conversion process towards 
the ODN. The OLT is also responsible for multiplexing 
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and cross-connecting ATM traffic and for managing all the 
PON specific aspects of the ATM transport system. In 
short, together with the ONU, the OLT is responsible for 
providing transparent ATM transport service between the 
User Network Interface (UNI) and the Service Node 
Interface (SNI) over the PON. 


The ONU interfaces over the IFpon (PON interface) to the 
OLT, and to the UNI. The ONU is active and decouples the 
access network delivery mechanism from the in-house 
distribution. The ONU core consists of the ODN interface, 
Service interface, Multiplexer and powering. 


The ODN provides the optical transmission medium for the 
physical connection of the ONUs to the OLTs. The two 
directions for optical transmission in the ODN are identified 
as follows: 

-downstream direction for signals travelling from the OLT 
to the ONU(s) 

-upstream direction for signals travelling from the ONU(s) 
to the OLT. 

Transmission in downstream and upstream directions can 
take place on the same fibre and components (duplex/diplex 
working), or on separate fibres and components (simplex 
working). The ODN offers one or more optical paths 
between one OLT and one or more ONUs. Each optical 
path is defined between reference points in a specific 
wavelength window. The ODN must be characterised by 
optical wavelength transparency, reciprocity and fibre 
compatibility. 


With the exception of FTTH, all solutions for the provision 
of FSAN discussed use some form of high-speed metallic 
access technology for service delivery to the customers’ 
premises. Prime consideration is given to the use of Very 
High-speed Digital Subscriber Line (VDSL) technology [5]. 


The broadband NT provides onward transmission of data 
over the building wiring and an interface to the home 
wiring. Figure 3, show one option for an NT configuration. 


Splitter (only for 
ADSL, VDSL, FTT:) 


Figure 3 ATMF25.6 and 10BaseT interfaces in NT unit 


In the above architecture, the ATM transport protocols at an 
IFpon are described as consisting of Physical Media 
Dependent Layer, Section Layer, and ATM Layer. Physical 
Media Dependent Layer includes the modulation schemes 
for both the upstream and downstream channels, 
electrical/optical adaptation and wavelength division 
multiplexing. Section Layer is responsible for managing 
the distributed access to the upstream PON resource across 
the multiple ONUs. This is a key protocol element and will 
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directly affect the resulting ATM QoS. The Section Layer 
is divided into PON transmission and adaptation sublayers 
which correspond to the transmission convergence sub- 
layer of the B-ISDN in ITU-T 1.321. The ATM protocols 
should see no change in the way they operate over the PON. 


Some of the key characteristics of the FSAN architecture 

are the following: 

e ATM PON (APON) in the ODN in accordance with the 
ITU-T G.983 Recommendation, the ATM Forum 
(RBB. baseline/doc/draft/ BTD RBB 001.00) and as 
discussed in the GX FSAN initiative. 

e Another important requirement is the support of VDSL 
(Very High Bit Rate Digital Subscriber Line) as drop 
technology for the customer access. This enables 
utilisation of copper already invested for in the local 
loop by traditional telecom operators. 

e The interface to the Service Node (SN) is the VB5.x 
interface which allows narrowband and broadband 
integration of services over ATM. Integration of 
traditional narrowband services such as POTs and 
ISDN with broadband is quite important in the case of 
FTTB/Curb/H architectures, or deployment of FTTx in 
green field areas. Such integration eliminates the need 
for both copper and fiber deployments to the point x. 
In the case of the FTTCab architecture, which uses 
VDSL drops to distribute broadband and narrowband 
services, the narrowband services could be initially 
carried to the ONU over the existing copper network 
and combined at the ONU and transported together 
with the broadband signal over the VDSL to the 
customer premises. In this way the FSAN network acts 
as a broadband overlay. Narrowband service growth 
will be managed by either provisioning over the 
broadband overlay, or growth in the old network 
depending on economic factors. Figure 4, shows the 
broadband overlay configuration at the initial stages of 
broadband service deployment in the case of FTTCab. 

e The UNI interfaces, according to FSAN, must support 
ATMEF25.6, Ethernet 10BaseT and Circuit Emulation. 

e Also, the FSAN system must be able to support SVCs 
and Switched Digital Video Broadcasting (SDVB), in 
order to offer a competitive positioning against the 
traditional HFC architectures used for TV/Video 
channel broadcasts. 


Existing E-Side 
and D-Side Cables 


2 Box (or plinth) Box (or plinth) 
Figure 4 Broadband overlay to existing narrowband 
network 
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III. SERVICES IDENTIFICATION 


FSAN systems could theoretically provide the complete 
range of all currently known and new services being 
discussed for the residential subscribers and business 
customers. These services cover a broad range of network 
requirements like bit rate, symmetry/asymmetry or delay, 
and range from video distribution, with varying degrees of 
interactivity, to electronic data transfer, LAN 
interconnection, transparent Virtual Paths, and so on. 


What specific services should be provided is more clear to 
some operators than to others and depends heavily on the 
particular regulatory conditions of each operators’ markets, 
as well as on the own market’s potential. How these 
services are delivered in a cost effective way is a function 
not only of legal conditions, but also of factors including 
existing telecommunications infrastructure, dwelling 
distribution and residential and business customers mix. 


The main services to be supported by FSAN-type systems 
according to [1] are shown in Table 1. 


Traffic characteristics 


Digital broadcast services, | Asymmetric broadband: 
VoD, Internet, distance | 2 - 25 Mbit/s down, 
learning, telemedicine, etc. | 64 kbit/s - 2 Mbit/s up 


Telecommunication Symmetric broadband 2 
services for small business | Mbit/s to at least 
customers, teleconsulting, | 4 x 2 Mbit/s. 
etc. 
POTS and ISDN 


Access network must be 
able to provide in a 
flexible way the narrow- 
band telephone services 


Table 1 Service Sets 


IV. FSAN SYSTEM LAB TRIAL DEMONSTRATION 


A. Lab Trial Objectives 

The primary objectives of the FTTH/O lab trial (which is 

part of the BOBAN project), aligned with the objectives of 

the BOBAN project as agreed amongst partners, may be 
viewed as being the following: 

e Identify and purchase a FSAN G.983-like FTTH/O 
system. 

e Build-on a lab trial with connection to service platforms 
(IP services, Real-Time ATM video etc). 

e Evaluate the FSAN like system. 

e Evaluate the provisioning of multiple broadband 
services to the user. 

e Analyse and assess results of the experimentation 
including interworking, service provisioning, FSAN 
Standard compatibility, system maturity and 
management features offered. 


B. Lab Trial Set Up 


Shown in figure 4, is the system setup for the Lab trial. The 
configuration is a Fiber to the Home/Office architecture. 
Available are three ONUs with ATMF25.6 interfaces for 
connection to PC or Set Top Box. The transmission 
between the OLT and the ONUs is based on ATM over a 
passive optical network. The passive optical network 
consists of a passive splitter connecting the ONUs with 
single mode fiber, in accordance with G.652. The PON 
interface is G.983.1 compliant with a symmetrical capacity 
of 155 Mbit/s. PON protection to the first split is possible. 
The interface of the OLT to the service node is STM-1 with 
1+1 protection. The OLT offers ATM cell multiplexing 
and demultiplexing and supports Permanent Virtual Paths 
(PVPs) and Permanent Virtual Circuits (PVCs). The 
classes of Quality of Service (QoS) supported are Constant 
Bit Rate (CBR), Unspecified Bit Rate (UBR). 


The OLT is connected to the various services over the ATM 
and IP network. 


ONU interfaces 


OLT Interface ATM25 
ATM 155 IF ATM155 


Video server 


Web server 
ssid 


Management 


Figure 4 Laboratory Trial Test Setup for FTTH 


The available service set includes: 
1. Streaming Video 
-MPEG-1 
-MPEG-2 
Both MPEG/IP/ATM and MPEG/ATM are possible 
Both display on PC and STB+TV are possible 
2. Fast Internet 
3. IP-Telephony 


C. Lab Trial Tests 


The objectives of the Lab trial have been the following: 

e To examine the FTTH/O system’s PON performance - 
Physical layer. 

e To examine the FITH/O system’s end-to-end ATM 
over PON performance, i.e. at the interface to the 
Service Network and User Network - ATM Layer 

e To examine service delivery using the ATMF25.6 UNI 
interface and possibly investigate the effects of the 
Customer Premises Network (CPN). 

e To test end-to-end performance parameters like 
bandwidth, access transmission delay, response time and 
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cell loss/discard ratio and associate such values with 
service types. 

e To test overall Quality of Service (eg. Video 
Broadcasting) offered at both the ATM level and as seen 
by the user (user perception of quality of service). 

e To examine deployment issues like the complexity of 
installation and interfacing, the means of planning and 
deploying the hardware resources, the complexity of the 
management system and the richness in management 
features. 


1) Physical Layer Tests 
ODN testing should demonstrate optical wavelength 
transparency and consider evaluation of the effects of 
splitters. Devices, such as optical branching devices 
(splitters) in the ODN, which are not intended to perform 
any wavelength-selective function, must be able to support 
transmission of signals at any wavelength in the 1310 nm 
and 1550 nm regions. Other issues under investigation 
regarding the splitter in the ODN are the effect on 
performance of: 
- the splitter positioning along the PON link (ie. close or 

distant to OLT) and 

- reflections caused by the unterminated splitter ends. 
Associated with the physical medium-dependent layer 
requirements, the following must be tested in accordance 
with the specifications of ITU-T G.983: 
- digital signal nominal bit rate 
- bit rate accuracy 
- operating wavelength 
- transmitter at reference point S 
- launched optical power without input to the transmitter 
- minimum receiver sensitivity at reference point R 
- minimum receiver overload (OLT/ONU) 

2) ATM Layer Tests 
The testing procedure at the ATM level involves both in- 
service and out-of-service mode tests [6]. In the in-service 
mode, use is made of the performance monitoring 
functionality of the FTTH system itself based on the 
analysis of the contents of OAM cells. In the out-of-service 
mode, after having set up a connection dedicated to the 
measurement of cell transfer performance, appropriate test 
cell sequences are transmitted on this connection and are 
analysed at the receive side. 


The tests involve setting up WPC/VCC connections in the 
FTTH system under test and specifying traffic parameters 
and quality of service (QoS) associated with the required 
class of service on these connections. The profiles for the 
test traffic are also set and background traffic, used to stress 
the system under test, is generated. The tests result in the 
display of ATM network performance parameters as 
follows [7]: 


1. Enrror-related network performance parameters 
— Cell Error Ratio: 

— Cell Loss Ratio; 

— Severely Errored Cell Block Ratio: 

— Cell Misinsertion Rate. 
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2. Availability-related network performance parameters 
— Availability Ratio; 

— Mean Time Between Outages. 

3. Delay-related performance parameters 

— Mean Cell Transfer Delay; 

— Cell Delay Variation (1 point and 2 points). 


Other fundamental tests for evaluating an FSAN-type 

system include: 

1. Robustness of guaranteed QoS - Effects of PON 

network (including customer premises live network) on 

QoS classes, traffic shaping and congestion control 

mechanisms. 

System Architecture related parameters — Evaluation of 

parameters such as, the maximum upstream and 

downstream throughput and maximum sustainable 
burst size at the interfaces. Also, simultaneous active 
connection capacity tests for the number of point-point 
setups, point-to-multipoint setups, etc, and latency tests 
for delay in connection setup, connect, and release. 

Policing Functions - Effects of PON network 

(including customer premises live network) on policing 

functions. Measurement of Cell Discard and Received 

Cell rate Verification with non conformant traffic. 

4. Test Ranging procedure - A full digital in-band based 
ranging method should be used by the PON system to 
measure the logical reach distances between each ONU 
and the OLT. The maximum range of the PON is at 
least 20 km. The transmission delay measurement for 
each ONU should be capable of being performed whilst 
the PON is in-service without disrupting service to 
other ONUs. The effectiveness of the Ranging 
protocol used is tested by introducing variable delay 
and attenuation in the links. 


io) 


tad 


V. CONCLUDING REMARKS 


In this article, a description of FSAN type architectures and 
major characteristics has been given with emphasis on 
APON FTTH. The major issues of an FSAN type system 
are the ATM PON interface between the OLT and ONU, 
the UNI interface which supports Ethemet, ATM and 
Circuit Emulation, the SNI interface which is ATM with 
signaling protocols like VB5.x for narrowband and 
broadband integration and support of SDVB zapping 
protocols. Having identified the FSAN type possibilities, 
services have been also identified. From the results of the 
studies, an FSAN FTTH lab trial has been setup for 
broadband service provisioning demonstration. The major 
issues for testing such an access network platform, based on 
both end-to-end service quality with integration to other 
service provisioning platforms, as well as physical and 
ATM layer performance testing have been identified. 


The results of this lab trial will be included in the BOBAN 
Workshop which is to be held in Turin, Italy, on the 11 and 


12" July 2000, together with the results of the rest of the 
BOBAN project. 
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Abstract 

There has been an international effort from 
standardization bodies to create a common model to 
govern service management in a communications 
environment. Various schemes have emerged from 
the standards. The most influential ones are TMN 
(Telecommunications Management Network) and 
FSAN (Full Service Access Network) - which partly 
cover the service management issue. This paper 
addresses the issue of service management 
specifically for the provision of broadband services. 
The requirements, roles and processes involved in 
service provisioning using examples of broadband 
access networks are addressed as well as the issue of 
future automation in the service management 
systems that will greatly assist the process. This 
work was done within the framework of the 
EURESCOM project BOBAN. 


Introduction 

Evolution in telecommunications and deregulation have 
created the need to offer a wide range of services [1]. 
Old, vertical management architectures which 
contained all of the processes, systems and tools needed 
to manage single service, are no longer cost effective in 
this environment. The support of multiple existing and 
as yet unknown services on a single platform naturally 
leads to the need for an integrated approach to network 
and service management. However the integration of all 
of the network management functions itself leads to a 
very complex undertaking. 


The basic management requirement of the FSAN [1] 
and TMN [2] is to manage a range of services from a 
common platform. Both TMN and FSAN approach the 
subject from the angle of the necessary network 
architecture and elements for achieving integrated 
service provisioning. TeleManagement Forum [3] [4] 
describes in depth service provisioning processes for 
each player in the service provisioning chain and the 
DSL Forum gives specifically the service provisioning 
process for ADSL access. 


This paper uses the case of broadband service 
provisioning as studied in the EURESCOM project 
BOBAN to suggest a layered approach for a service 
management platform and describe the parameters 
required to be set between the interfaces. The ADSL 
service delivery is used as an example for explaining 
the concepts dealt with in this paper. 


Service provisioning models 

Management of service refers to a set of capabilities that 
allow the exchange of management information to 
assist administrations in conducting efficiently 


provision of service. 


A process can be viewed as a series of tasks that describe the 
operations, functions and the relationship between the 
functions. Processes also identify the instances of data that 
are manipulated by the operations functions. A set of roles is 
also defined in the process of service delivery that reflect the 
responsibilities in each part of the service. 


Separation of roles reflects "Open Network Provisioning" [6] 

[7] directives. The aim of ONP is to separate the roles 

(business activities) in a liberalised environment as a way to 

enhance competition. The roles can be aggregated in the 

following way: 

e User. The interest of the User is to have end-to-end 
connections, whether this means inter-connection of 
business domains or access to content. 

e Service Provider (SP) makes contracts about delivery of 
services. Service Providers often use non-technical names 
to describe their services to Users. Service Providers 
compose their end-user services from network services 
sold by network providers. 

e Network Providers (NP) may use different technologies to 
materialise connections/access to networks. Network 
Providers support a network technology independent 
provisioning (ordering) interface towards Service 
Providers, equipment specific interfaces towards the 
network and are assumed to have an information storage, 
which contains an abstract picture of the network. 


There are several aspects which have an impact on the 
choice of the most cost effective way to do service 
provisioning. The most important aspects are the following: 

e Pre-provisioning and real time provisioning. Pre- 
provisioning means the network creation: installation of 
physical resources (e.g. new DSL access multiplexers) and 
provisioning of logical resources (e.g. virtual circuits). 
Pre-provisioning may suit all services or there may be 
cases, where the pre-provisioned resources must be re- 
provisioned for particular services. Real time provisioning 
means the provisioning of physical and logical resources 
needed to deliver the service to the final customer. 

e Network size. In smaller networks, the relative cost of 
provisioning software tends to be higher. On the other 
hand, pre-provisioning is often based on _ over- 
dimensioning, which is more costly in larger networks. 
Therefore pre-provisioning is often more cost effective in 
small networks than in large networks. 

e Life cycle phase of ADSL-service. In early stages of the 
life cycle, the ability for provisioning of basic services 
quickly and for a reasonable price is important. Later in 
the life cycle, when competition is intensified, ability to 
differentiate and provide quality assurance becomes more 
important. 
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Obviously, a single provision model cannot fulfil all 
different requirements. Therefore three provisioning 
models will be studied: 

e Minimum real time provisioning. The goal of this 
model is to encourage the use of similar CPE 
(Customer Premises Equipment) technologies and 
over-dimension the ATM network so that pre- 
provisioning can be used in most parts cf the 
network. 

e . Network resource optimization. This model aims to 
allocate exactly the needed network resources to 
connections. 

e Rich management. This model adds _ different 
proposed management-based services to the network 
resource optimization model. The model is handled 
on more abstract level and covers most of the 
provisioning process. Management of these services 
will be added, when they are defined. 

Provisioning may use a combination of different models 

or may use different management solutions. 


Reference Model Architecture 

Figure 1 shows an architecture that represents the basic 
model for service provisioning. The architecture uses 
the example from ADSL deployment which can be 
extended to other broadband services. The figure shows 
an integrated end-to-end service management system 
with the appropriate parameters that have to be defined 
between the network elements. Hence the Virtual Path 
Identifier (VPi) and Virtual Circuit Identifier (VCi) are 
defined as (VPi/VCi)user aS those identifiers assigned to 
the user. It is important to note the option of CD ROM 
provision for installation by the user. A costly 
alternative would be a technician to visit the customer 
premises. 


Pre and real Time Provisioned Parameters and SW 
This section describes parameters, which have to be 
used to different equipment. 


PC (Customer Premises network) 

The parameters and SW to be provisioned are listed in 
two cases: PPP over ATM protocol stack (e.g. in case of 
USB modem) and PPP over Ethernet stack. 

The User owns the Customer Premises equipment and 
is ultimately responsible for it. However, since the User 
buys services from both SPs and NPs, there is a strong 
expectation that they provide SW, which is easy to 
install and does not mix other settings. The SP(s) and 
the NP (if chosen so) must provide the user with 
authentication information. 


For USB-modems, the needed components are: 
e PPP/ATM stack (exists e.g. in Windows) 
e USB modem driver (delivered with the modem) 
e Dial-up software (existing Windows application) 


For Ethernet modems the following components are 
needed: 

e PPP Ethernet stack (may need installing) 

e Ethernet card (if not installed already) 


e Ethernet modem driver 
e Dial-in software suitable for PPP over Ethernet protocol 
stack 


On [P-level, IP address may be defined or obtained with 
DHCP. The choice of method and potential DHCP server 
depend on different factors. The way equipment have to be 
accessible and the easiness of set-up are the main factors for 
the choice of IP address allocation. If the equipment must be 
reachable from the Internet (like a server), it (or the NT) must 
be allocated a public, permanent address. If the equipment 
must be reachable from inside a service domain, like a private 
network, that network should provide the IP address. Some 
applications communicate the IP-address of the equipment to 
other applications that will use the address to send traffic. 
Addressing from access network would allow all connections 
to communicate. At present, NAT in customer premises 
network would prevent this but future protocols may solve the 
problem. If there are several customer premises equipment, 
which always should have the same IP address, different log- 
in names may be a solution. Modem may be owned by the NP 
or by the User. The NP is responsible to configure its own 
modems and to give advice for the configuration of other 
modems. If the User acquires a new modem with a different 
stack, the user must order corresponding modifications to 
routing from the SP or NP. 


NT/ATU-R 

(VPi/VCi)user Must be configured on the ATU-R. This is the 
responsibility of the NP though the vendor may pre-configure 
the equipment. 


DSLAM 

The following must be provisioned if pre-provisioned values 

cannot be used (the NP is responsible for the DSLAM): 

e Cross connect the network port with the ADSL position of 
the customer line card with the right service profile 
number. (Service profile means ADSL level parameters 
(e.g. bitrate, latency, noise margin) and ATM level 
parameters (e.g. QoS, MCR, PCR). 

e Configure the (VPi/VCi),- between DSLAM and ATM 
Node. 


ATM regional Network 

A pre-configured path through the ATM network may be 

used or all connections may be provided individually. In the 

latter case, the data needed to perform the cross-connection 
are: 

e Nodenamel, slotl, portl, VPi, VCi, plus a number of 
parameters depending from the node type and the 
software version. 

e Nodename?2, slot2, port2, VP, VC, plus a number of 
parameters depending from the node type and the 
software version. 

The configuration is done by the NP. In the network resource 

optimization approach, the parameters are based on an order 

from the SP. The order reflects the contract between the SP 
and the User and connection parameters must be updated 
whenever the contract is updated. 
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BAS (Broadband Access Server) 
In principle, the BAS may belong to the Access 
Network Provider or to the SP. 


If BAS belongs to the SP: 

e A path between the ATM network of the Telco and 
the SP BAS has to be pre-configured 

e The service provider must take care of stacks and 
modem compatibility, see the following 
requirements. 


If BAS belongs to the access network provider: 

e Router and ATM sub-interface must be pre- 
configured. 

e Termination of VPI/VCI from the customer on the 
right NRP with the right VC-class (PPP over ATM 
or PPP over Ethernet protocol stack) 

e If the NP, itself, identifies all Users, sufficient 
authentication information must also be provided. 
The provisioning of the above listed capabilities and 
interoperability with the SP security system are 

responsibilities of the NP. 


Parameters to be provisioned 
Table 1, contains the parameters and network resources 


necessary for minimized real time provisioning and 

network resource optimization provisioning. 

Minimized real time provisioning is based on the 

following assumptions: 

e Support for Ethernet modems is pre-provisioned 
(although this may change with the market). 

e ADSL and ATM networks are dimensioned for high 
throughput. In ATM, overbooking is used. UBR is 
used. Quality is achieved by keeping the maximum 
network load in 60 to 70%. 

e All users are authenticated to prevent misuse. 

e Service Provider or BAS takes care of bandwidth 
management and user profile. 


Automation in service Provisioning 
Automation is done in order to make provisioning more 


efficient and hence increase the company profit. The 
main areas, where these benefits may be gained are 
simplified work processes, synchronization of different 
actions and empowerment of user to manage features. 


Provisioning is a target for automation because the cost 
of automation of simple repetitive activities is relatively 
low. Automation of provisioning may however bring 
considerable benefits in the form of fast and cheap 
processes and new competitive services, which empower 
users to take full benefit of their connections. 


Benefits from Automation 

In the present phase of ADSL _ introduction, 
provisioning of most lines requires manual work in 
exchanges. Automated ADSL provisioning allows work 
force management to fine-tune the synchronization of 
customer availability and work force movement. Also, 


automation allows customers to manage features of their 
existing connections within their pre-agreed profiles. This 
can most easily be achieved in cases, where cooperation of 
service and network management has already been automated 
for better service like in many Telcos. Therefore, automation 
provides an extra competitive advantage to Telcos. Especially 
NPs, which have chosen network resource optimization may 
gain benefits. 


In the early phase, successful provisioning process would look 
like the one in figure 2 where the automatic request from user 
would activate a series of functions where reservation, 
configuration and activation would happen automatically. In 
future the customer will have the power to control the 
provisioning process. 

This tendency wiil depend on many factors like: 

e Density of users, which are willing to pay for faster 
connections. 

e Cost of complete pre- provisioning or efficient prediction 
compared to individual provisioning. Cost of equipment is 
here an important factor. 

e Other earning models. How much can Telcos earn from 
broadband data users. 

e When will e-commerce be mature enough to be used for 
making contracts. 

e Competition. 


Automation enables the point of control to be moved closer to 
the user and/or the design of network products, which are 
based on very fast fulfillment. Ultimately this would enable 
the user to order services and have them delivered 
immediately. The customer controlled provisioning process 
would look in its simplified form like the one in figure 3. 


The benefits of these scenarios are: 

e Faster service, e-commerce trains people to expect 
immediate delivery. 

e More accurate service (no "human" errors by computers) 

e More efficient work flows. Most initial benefits can be 
expected from service provider to workforce management. 
Later, 24h/7d management service empowers users. 


Since the development of information systems is more time 
consuming than adaptation of new technologies, gradual 
preparations for automation have already started. Both 
service provider processes and resource reservation are not 
standardized and vary very much among companies and are 
at least automated. 


The automation of activation/deactivation is a potential target 
for new automation systems because of: 


e Variation in features, which are supported by different. 


vendors of broadband access network technologies, do not 
differ so much. 

e Management processes of these features at different 
operators are very similar. Work orders are sent to 
management centers, where work is done manually. 

e Interfaces of the vendors for the same system are the same 
or very similar at most operators. 

e Lack automated, “off the shelf", multi-vendor ADSL 
activation/deactivation products on the market 
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Unified Interface 
The concept of a unified interface is applied in the case 
of communication between service management and 
network management. It is a concept that ideally will 
enable integration of network management systems 
from many vendors in order to make the automated 
service provisioning feasible. More precisely, it is a 
reference point depicting the communication of 
functionality between the management layers. The 
unified interface combines features from different 
vendors, which the operator wishes to sell. The basic 
attributes of a unified interface can be identified as: 

e Identification of resource reservation. Resource 
reservation will provide the correct port 
identification. 

e Bandwidth. The maximum transport capacity which 
the user may have. 

e Error Correction Coding / Latency. Coding impacts 
error performance, latency and net payload rate, 
which the user may have. Service providers will be 
able to indicate the coding but future empowered 
end users will need guidance. 

e Selection of ISP or core network or private network. 
Optional field. Information about options in the 
contract/reservation. 

e Support of QoS/CoS. 


Conclusions 

This paper presented the requirements, roles and 
processes involved in service provisioning for 
broadband access networks. It highlights the importance 
of automated service provisioning as a means of 
increasing efficiency and company’s profits and 
describes the notion of a unified interface that will 
enable integration of network management systems 
from many vendors in order to make the automated 
service provisioning feasible 
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Figure 2: Provisioning process work flow. User is the 
person serving the end customer. 


Figure 3: Customer controlled provisioning process. 
User is the end customer. 
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Analysis of Energy Dissipation in the Memory 
Hierarchy of Embedded Systems: A Case Study 
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Abstract 
This paper presents an experimental study of the energy 
dissipation in the memory hierarchy of an embedded system 
during the execution of a specific application. 
Because of its ubiquity in many applications in different 
fields, we have chosen the Discrete Cosine Transform 
(DCT) as a benchmark for our study. The experiments 
consist of evaluating the energy dissipation of the memory 
hierarchy of the system, by comparing different software 
implementations of the DCT. An instruction-level simulator 
and a cache simulator have been used to perform energy 
estimation. 
The experiments show that the fastest (i.e., more hardware- 
or performance-efficient) implementation is not always the 
most beneficial from the memory energy stand-point. 


Index Terms 
Low-Power Design, Memory Energy Optimization; Memory 
Models; Discrete Cosine Transform. 


I. INTRODUCTION 


Memory hierarchy design offers additional opportunities 
for the optimization of the energy required by embedded 
systems. In such systems, very little freedom is given to the 
designer in choosing the core processor. Conversely, 
communication and, especially, storage resources are more 
application-specific; thus, they can be customized with more 
flexibility. Memory size is tailored on application 
requirements (e€.g., program size, communication buffer 
size). Similarly, memory organization (e.g., caching, 
interleaving) is often driven by performance, power and cost 
(area) constraints. Communication resources, such as buses, 
connect processors to application-specific memories and 
peripherals. Consequently, they are inherently application- 
specific as well. 

Several works have addressed these issues in the recent 
past. One category has addressed the problem of optimizing 
the memory hierarchy [1,2,3,4]. These approaches specify a 
core processor and an application mix, and explore the 
memory hierarchy design space to find the organization that 
best matches processor and application. 
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Usually, the design space is parameterized and discretized 
to allow exhaustive or near-exhaustive search. The best 
memory organization is obtained by simulating the 
workload for all possible alternative architectures. 

Another approach is concerned with memory allocation. 
These techniques assume that memory access patterns can 
be transformed to optimally exploit memory hierarchy. 
Program transformation techniques have been proposed 
targeting instruction and data memory. Techniques for 
instruction memory optimization change the location of 
basic blocks (possibly by inserting jumps) in such a way 
that the most frequently executed blocks fit into a small 
instruction buffer [5,6,7]. Data memory optimization 
techniques [8] have been developed for specific data types 
(e.g., fixed-size arrays) and control structures that have 
regular and predictable access patterns. In this case, 
memory accesses can be re-ordered to improve cache 
locality or to optimally exploit small and power-efficient 
data buffers. 

In this paper, we analyze the joint effect of memory 
hierarchy structure and memory access patterns on energy 
consumption. We focus on embedded systems, where a 
given core processor repeatedly runs an embedded 
application, and where power or energy constraints are 
usually more stringent than for general-purpose systems. 

In these systems, typical embedded applications realize 
common functions (such as digital filtering or signal 
transform) for which highly optimized implementations 
exist, and whose re-use is strongly enforced. However, the 
optimization criterion used to write these programs is 
usually performance. Although performance is apparently 
not in contrast with energy (a faster algorithm implies fewer 
operations and thus fewer memory accesses), the memory 
access patterns are also important and can sometimes be the 
dominating factor for energy consumption in the memory 
hierarchy. 

In order to study these effects, we have selected the Discrete 
Cosine Transform (DCT) as the specific application, and we 
have analyzed the impact of different implementations on 
the energy consumption of the memory hierarchy for 
various cache configurations. 

The results demonstrate that the fastest implementation is 
not always the most energy efficient. These results further 
confirm the claim that writing energy-efficient software is 
an important paradigm, and that energy savings, in core- 
based systems, cannot be obtained as a by-product of 
performance-driven software. 
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II. THE DCT ALGORITHM 


The DCT has become an ubiquitous computation in several 
applications and standards (JPEG, MPEG, HDTV), thanks 
to its relatively low implementation complexity and _ its 
similarity to statistically optimal bounds [9]. 

The DCT is a linear transform that converts a signal from 
the spatial to the frequency domain. The basic transform 
maps a vector X of N input data points onto a vector Y of N 
output data points using a transform matrix C of size NxN. 
The transform vector X is obtained as: 


Ym) = 2/N'? km Qim=o.N-1 cos((m(n+1)m)/N) (1) 


for n,m = 0,...,N-]. Coefficients k; are kg= 1/V2, and k;=1 for 
all other 7. 
The formula of Equation | identifies the one-dimensional 


(1-D) DCT. Practical applications of the DCT such as still | 


or moving image processing (e.g., JPEG or MPEG) use 
higher-order (2-D or 3-D) transforms, that are derived from 
Equation | by extending the summations over the additional 
dimensions. However, the mathematical properties of the 
DCT show that this is a separable transform, that is, multi- 
dimensional transforms can be obtained by separating the 
summations in each dimension of the space. In other terms, 
a multi-dimensional DCT can be computed as a series of 1- 
D DCT, which can then be considered as the basic building 
block of any DCT-based application. Therefore, we will 
also focus on 1-D DCT as our target application. 


2.1 DCT Algorithms 


In spite of its apparent simplicity, direct implementation of 
the DCT is not very efficient from the computational stand- 
point, since it requires O(N’) multiplications and as many 
additions. Therefore, as for more conventional discrete 
transforms (e.g., DFT) there exist several fast DCT (FDCT) 
algorithms. Speed has top priority since the DCT is usually 
invoked very frequently in real-life applications, typically 
under stringent throughput constraints (e.g., in the MPEG 
standard, processing a frame requires 324000 2-D 
transforms per second). 

As a consequence, regardless of the actual target of the 
implementation (in hardware or in software), a significant 
research effort has been spent in the last years in order to 
develop fast DCT algorithms. A comprehensive review of 
these algorithms is proposed in [9]. 

Most of these “fast” algorithms target the reduction of the 
overall number of operations (multiplications and 
additions). This metric is in accordance with the constraints 
of both software and hardware implementations. In the 
former case, fewer operations directly translate into faster 
execution times; in the latter case, fewer hardware operators 
allow faster cycles and reduced latency. 


Unfortunately, when energy constraints come into play, the 
effects of speed optimization become less predictable. This 
is especially true for software implementations. In fact, 
although shorter execution times also typically translate into 
lower energy consumption, the memory access patterns can 
adversely affect this intuitive relation. This aspect cannot be 
neglected, since it has been reported that usually more than 
50% of the energy dissipation of a system is due to the 
memory hierarchy sub-system. 

This effect is further amplified by the possible presence of a 
cache in the system, because cache energy is extremely 
sensitive to the memory access patterns. Unlike main 
memory, the content of a cache evolves dynamically; this 
continuous update of the working set, that also depends on 
the structural and functional parameters of the cache (size, 
associativity) has a dramatic impact on the overall system 
energy consumption. 

In the case of an embedded system, where the core 
processor executes the specified application continuously, it 
is possible to profile such memory access patterns, and 
precisely trace the evolution of the cache contents. This 
allows an accurate evaluation of the relation between code 
optimization and cache energy consumption. 

The above considerations lead us to the analysis reported in 
this paper. In other words, given the possibility of 
accurately profiling memory accesses, and a cache energy 
model, we tried to answer the question whether a faster 
algorithm is always also less energy consuming. 


2.2 Alternative DCT Algorithms 


Among the several options available, we have chosen three 
alternative implementations of the DCT. The reference 
implementation (called DCTa henceforth) corresponds to 
the direct translation of the summation of Equation |}. 

The other three algorithms are taken from [9]. The choice 
has been dictated by their “structure”. For example, many 
fast DCT algorithms rely on some degree of unrolling of the 
main DCT summation. Algorithms in this class are 
interesting candidates for our analysis, because the 
reduction of the number of operations is achieved by 
breaking the regular access pattern of the loop 
corresponding to the summation. 

Other algorithms are based on a recursive paradigm, where 
a N-point DCT is expressed in terms of DCTs of smaller 
size. Again, these algorithms only partially keep the 
regularity of the accesses of the direct implementation. 

The first algorithm (DCTb) is an implementation of the 
method of [10]; it belongs to the class of the recursive 
algorithms, although its implementation is non-recursive. 
The second algorithm (DCTc) is a minor variant of the first; 
instead of computing the table of cosines required by the 
DCT on-the-fly, DCTc uses a pre-computed cosine table. 
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Finally, the third algorithm (DCTd) is an implementation of 
the method of [11], and belongs to the class of partially 
unrolled algorithms. 

For large values of N, algorithm DCTc is the fastest, 
followed by DCTb. DCTd performs slightly worse than 
DCTb and DCTc. The reference algorithm is extremely 
slower than the others (up to two orders of magnitude). 
However, in typical applications (¢.g., JPEG) the DCT is 
repeatedly applied to relatively small blocks (e.g., N=8), 
rather than to very large blocks. In these cases, the 
performance differences smooth out, although the ranking 
between the four implementations remains the same. 


Ij]. EXPERIMENTAL RESULTS 


3./ Experimental Setup 


The experiments have been carried out under the 
assumption of a system consisting of a core processor, an 
on-chip cache, and an external main memory. For the 
evaluation of the energy consumption in the memory 
hierarchy, we used the flow depicted in Figure 1. 


ARMulator 


R/W accesses 


DinerolV 


Hit/Miss 


Memory 
Model 
y nergy 


Figure 1. Experimental Flow. 


The entry point is a C program, which is fed to 
ARMulator, an instruction-level simulator for the ARM 
processor family. In particular, we have used an ARM7 
processor. The memory access trace is then filtered to 
extract data address information. Evaluation of cache 
statistics requires a cache simulator, in order to account for 
the dynamic effects caused by the cache replacement 
mechanism. The filter thus generates data addresses in one 
of the formats supported by the Dinero cache simulator 
that we have used to evaluate cache statistics. The resulting 
data are plugged into a cache model derived from that of 
[2], to get the corresponding energy consumption. Notice 
that the energy consumption refers to the whole memory 
hierarchy, because our model properly takes into account, 
according to the hit/miss profile, which accesses are in the 
on-chip cache and which are not. 


3.2 Experimental Data 


We have used the flow of Figure 1 with the four DCT 
implementations described in Section 2.2. All the four 
programs are used to compute the DCT for N=8 (as in 

JPEG), and have been fed with the same input data 

representing an image consisting of 6400 pixels. 

We used several cache configurations to analyze how the 

energy behavior changes over the cache space. The cache 

parameters used in the exploration are: 

e Total cache size in bytes, D; 

e Size of the cache line, L, in bytes; this value denotes 
how many memory words are actually contained into a 
cache entry (a set); 

e Associativity, a; this number determines the number of 
parallel memory arrays contained in the cache. 

These three quantities are not independent, and are bound 

by the formula D = L- a - S - w, where S denotes. the number 

of sets (lines) in the cache, and w the size of a memory 
word in bytes. In order to partially limit the exploration, we 
have constrained some of the quantities above. First, since 
we adopted a ARM-based flow, the size of the memory 
word is fixed to 32-bit (i.e., w=4). This also constrains the 
minimum value of L, since it must at least be of the same 

size of a memory word, i.e., L 24. 

Concerning cache size, we have assumed relatively small 

sizes, given the architectural assumptions of this work. 

Unlike general-purpose processor architectures, where 

caches may reach sizes in the order of the Mbytes, in 

embedded systems-on-a-chip the limited area budget 
constrains these sizes in the order of the Kbytes. 

Consequently, we have limited our analysis to three cache 

sizes: 1KB, 2KB, and 4KB. 

These constraints still allow for a very large set of cache 

configurations. We thus added a further limitation on the 

minimum number of sets in a cache, namely S > 256. This is 
dictated by the need of having a “well-shaped” cache. 

Putting a lower bound on the number of sets prevents the 

generation of unrealistically very “wide” caches. We ended 

up analyzing ten cache configurations, summarized in the 
table below. 


Cache 
at ig 
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Figures 2, 3, and 4 pictorially show the memory energy 
results. The figures are referred to 1KB, 2KB, and 4KB, 
respectively. From the analysis of the plots, we observe that 
the energy ranking of the algorithms is different from that 
of performance. In particular, we notice that algorithms b 
and c, the most efficient performance-wise, are also the 
most energy demanding, even more than reference 
algorithm a, whose regular access patterns make it quite 
low-energy. 

It is also interesting noticing how the improvement of 
algorithm c with respect to b (off-line vs. on-line cosine 
tabulation) actually ends up in an energy increase. This can 
be justified by the fact that off-line computation requires 
allocation of an additional array, whose references are more 
expensive than the repeated invocation of a library function 
directly. 

The energy ranking is (d<a<b<c). Interestingly, this 
ranking preserves, though with some differences in the 
relative energy figures, across all the cache configurations 
we have explored. 


DCT-a DCT-b DCT-c DCT-d 


Figure 2. Energy Results for a IKB Cache. 


DCT-a DCT-b DCT-c OCT-d 


Figure 3. Energy Results for a 2KB Cache. 


L=4, a=1 
BL=4, a=2 
OL=4, a=2 


q 


DCT-a DCT-b DCT-c DCT-d 


Figure 4. Energy Results for a 4KB Cache. 


IV. CONCLUSIONS 


We have analyzed the memory hierarchy energy 
consumption of different implementations of a given 
application. Such program is meant to be executed on an 
core processor in an embedded system. Experimental 
analysis has shown that faster implementations are not 
typically the more energy efficient, because their memory 
access pattern regularity can be partially destroyed. 
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Abstract—Achieving long battery life-time is the main goal in 
the design of several classes of portable applications. The 
capability of monitoring the impact of different design options 
on the discharge rate of the battery is thus key to guarantee the 
quality of the chosen optimizations. In this paper, we 
demonstrate experimentally how a battery model suitable for 
high-level digital simulation can be of help for studying the 
variations of the life-time of a battery when different loading 
conditions are assumed. In particular, the impact of the profile 
of the load current drawn from the battery on life-time is 
investigated in detail for the realistic case of Lithium-Ion 


DP /batteries. 


PAT TG 


Index terms—Digital systems, circuit modeling 


I. INTRODUCTION 


Since for portable applications the most critical issue is 
battery life-time, rather than power dissipation, optimization 
of digital circuits being used in such systems demands a 
careful understanding of the battery behavior; furthermore, 
accurate, simulatable models for battery and DC-DC 
conversion circuitry are required to properly tune the 
various optimization steps being taken during system 
development. 


In [1], we have presented a modeling approach based on the 
idea of constructing a high-level VHDL circuit that 
accurately emulates the behavior of a Lithium-Ion battery 
and that can be used in RT, behavioral or instruction-level 
power simulation. Accuracy is achieved through SPICE- 
level training of the model. In [2], the model has been 
extended to other types of battery chemistries, namely, 
Nickel-Cadmium, Alkaline and Lead-Acid. 


In this paper, we show how the discrete-time model of [1] 
can be successfully used to collect information about battery 
behavior that may be of great interest to designers of digital 
electronic applications. 


Luca Benini is with Universita di Bologna, Dip. di Elettronica, 
Informatica e Sistemistica, Bologna, Italy 40136. Giuliano Castelli, 
Alberto Macii, Enrico Macii, Massimo Poncino and Riccardo Scarsi are 
with Politecnico di Torino, Dip. di Automatica e Informatica, Torino, Italy 


For the experiments we consider Lithium-Ion batteries, 
since they have a dominant position as the chemistry of 
choice for notebook and laptop computers. We first present 
results concerning the life-time of the battery vs. different 
average values of current loads. Then, we discuss data 
regarding the dependency of the battery discharge rate on 
the frequency of the load current and the effect of the duty- 
cycle of the current load on the battery discharge rate. 


The remainder of this paper is organized as follows. In 
Section II we summarize the model we have adopted to 
carry out our investigation. Section III reports the results of 
the analysis. Finally, Section IV concludes the manuscript. 


II. BATTERY MODEL SUMMARY 


The high-level model of a Lithium-Ion battery we have 
introduced in [1] is derived from the circuit-level model 
initially proposed in [3][4]. 


Charge storage in a battery can be modeled as a capacitor 
with capacitance C = 3600 CAP, where CAP is the nominal 
capacity in Ahr, which is usually provided in the battery's 
data-sheet. By setting the initial voltage across the capacitor 
Vc = 1, we initialize the battery to its fully charged state. 
Unfortunately, the simple linear capacitor model is not 
accurate enough to model complex phenomena observed 
during battery discharge. In fact, the following three major 
effects must be taken into account: 


e Battery voltage has non-linear dependence on its state 
of charge: Voltage V;, decreases monotonically as the 
battery is discharged, but the rate of decrease is not 
constant. 

¢ The actual usable capacity of a battery cell depends on 
the discharge rate: At higher rates, the cell is less 
efficient at converting its chemically stored energy into 
available electrical energy. 

e The “frequency” of the discharge current affects the 
total amount of charge the battery can deliver: The 
battery does not react instantaneously to load changes, 
but it shows. considerable inertia, caused by the large 
ume constants that characterize electro-chemical 
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These effects can be modeled at the circuit level as shown in 
Figure |. 


Ve Most (Vrate) 


< {Vsense) ie fox sl & 


Figure |: Continuous-Time Battery Model. 


Dependency on the state of charge is taken into 
consideration by storing several points of the curve in a 
look-up table (LUT) addressed by the value of the state of 
charge (Vc). The model is accurate up to a minimum cut-off 
voltage, after which the battery is considered fully 
discharged. 


Dependency on discharge rate is modeled with a voltage 
source V,,,, in series with the charge storage capacitor. 
Voltage V),,, reduces the apparent charge of the battery 
(which controls the output voltage). The value of V),,, is a 
non-linear function of the discharge rate (which can be 
modeled by another LUT). 


Dependency on the discharge frequency is modeled by 
averaging the discharge rate used to control V/,,, through a 
low-pass filter (Ry,C)). 


Additional effects such as temperature and _ internal 
resistance are also taken into account in order to increase 
the accuracy of the model. 


According to [3][4], this model fits measured data fairly 
well (within 15%). This accuracy is acceptable, since the 
actual capacity of any group of cells may vary as much as 
20% between identical units, when we take into account 
manufacturing variances [4]. 


The discrete-time model of [1] is able to track the 
continuous-time one under various load conditions. In fact, 
the comparison of the estimates provided by the two models 
gives a 0.5% error for what concerns battery life-time 
analysis, and 0.7% when we consider battery output voltage 
results. 


The errors are mainly due to the intrinsic difference that 
does exist in the implementation of the two models. In the 
discrete-time model, the non-linear relationship between the 
two quantities is obtained by _ piece-wise linear 
approximation of values tabulated in an array. Conversely, 
in the continuous-time model, the interpolation of the 
tabulated values is obtained by imposing the continuity of 
the first derivative. 


III. EXPERIMENTAL ANALYSIS 


The availability of an accurate, discrete-time model of a 
battery allows us to investigate in details, yet quickly, how a 
battery behaves under different loading conditions. Before 
presenting the results of our analysis, we first briefly outline 
the key characteristics of Lithium-Ion batteries, since this is 
the chemistry we have assumed while carrying out our 
investigation. 


A. Lithium-Ion Batteries 


Lithium-Ion batteries have a dominant position as the 
chemistry of choice for notebook and laptop computers. The 
main reason for this fact is that Lithium-Ion batteries have 
the best volumetric energy density among all competing 
cells. In simple terms, Lithium-Ion batteries provide more 
energy than other cells for a given battery volume (or 
weight). On the other hand, Lithium-Ion batteries have 
higher cost than traditional secondary batteries like Nickel- 
Cadmium, because their fabrication technology is still quite 
expensive and not fully optimized. High performance and 
high cost have limited the market expansion of Lithium-Ion 
batteries to high-end products, where energy drain is high 
and high capacity is required to obtain acceptable life-time 
with reasonable weight. 


Safety of Lithium-Ion batteries is an additional issue. If 
Lithium-Ion cells are over-charged, they may be damaged, 
leak, or even explode. Hence, smart battery chargers are 
required, thereby further increasing ownership costs. 
Lithium-Ion battery producers are steadily improving their 
technologies, both by cell chemistry optimization and by 
embedding low-cost electronic controllers within the battery 
package, in order to reduce the external support needed for 
operating the battery safely. 


Currently, Lithium-Ion batteries are expanding their market 
dominance in the portable computer arena, and thanks to 
lower production costs, they are also extending their 
competitiveness to low-end products. 


B. Impact of Current Load Profile on Life-Time 


In Figure 2 we plot the life-time of the battery vs. different 
average values of current loads. The solid line in the figure 
refers to the ideal battery model, i.e., a linear capacitor 
model. The nominal capacity of the battery is 1.35Ahr. 


The other curves represent the response of the discrete-time 
model to five different load profiles. The profiles 
correspond, for a given average load current J, to five 
square waves with 50% duty-cycle and different “high” and 
*low” levels. The levels are symmetrical with respect to the 
average value and have a skew of 0 (i.e., the constant load), 
+0.2, +0.4, +0.6, and +0.8A. 
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The first, straight-forward conclusion that can be drawn by 
observing the plot is that the ideal, linear capacitor model 
sensibly overestimates battery life (e.g., for a constant 1.0A 
load, 6150s vs. 5100s, approximately a 20% overestimate). 


The purpose of this experiment is to emphasize that battery 
discharge does not only depend on the average current load, 
but also on the profile of the load [5]. More precisely, a 
smaller average current load can discharge the battery faster 
than a larger one, if the levels of the current are more 
skewed from the average value. This effect can be observed 
by intersecting the curves with a horizontal line. For 
example, if we take a life-time value of 4000s, we intersect 
the bottom curve (skew = +0.8A) at J = 1.17A, and the 
second curve from top (skew = +0.2A) at I = 1.24A. This 
implies that we can possibly get the same battery duration 
with a 6% (1.24A vs. 1.17A) higher average load. Similarly, 
by intersecting the curves with a vertical line (fixed J), we 
can quantify the potential life-time overestimation we can 
incur if we neglect the actual load current profiles. For 
example, if we intersect the line corresponding to J = 1.2A 
with the bottom and top curves of the plot, we have an 
overestimation range of 8.3% (from 3850s to 4170s). 


mela) 


Figure 2: Life-Time Sensitivity to Load Variations. 


C. Impact of Current Load Frequency on Life-Time 


Figures 3 and 4 show the dependency of the battery 
discharge rate on the frequency of the load current. Such 
current is assumed to be a square wave with 50% duty- 
cycle, with an average value of 1.0A and 1.6A, respectively. 


Each plot shows three curves, corresponding to three 
different periods of the square wave, namely 60, 300, and 
600s. Interestingly, the battery discharge rate is almost 
insensitive to the frequency of the current waveform. 
Differences in the output voltage for the three curves are in 
fact very small, although they tend to slightly increase for 
larger values of the average load current. 


Figure 4: Battery Life for Different Current Frequencies: 1.6A Avg. Load. 


D. Impact of Current Load Duty-Cycle on Life-Time 


The effect of the duty-cycle of the current load on the 
battery discharge rate is analyzed in Figures 5 and 6. The 
load current is assumed to be a square wave with a fixed 
average value of 1.0A (Figure 5) and 1.6A(Figure 6), and 
variable duty-cycle (10% for the solid line and 90% for the 
dashed line). The duty-cycle is defined as the ratio of the 
“high” value over the period. 


To guarantee the same average current for different duty- 
cycles, the “low” level of the square wave is fixed at 0.8A, 
whereas the “high” level obviously increases for decreasing 
duty-cycle values. For example, in Figure 5, the “high” 
current levels are 2.8A for a duty-cycle of 10% and 1.022A 
for a duty-cycle of 90%. This observation is important to 
give a correct interpretation to the results of Figures 5 and 
6. Although a smaller duty-cycle implies that the load is 
drawing a lower current for a larger fraction of time, it 
actually draws a higher peak value. 
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To some extent, we can claim that the effect of the peak 
load current dominates over that of the continuous load 
current. In other terms, shorter and higher pulses of load 
current tend to discharge the battery faster than longer and 
lower ones. 


This explains why, in both plots, the curve with a smaller 
duty-cycle discharges the battery quicker. This effect 
becomes more sizable for increasing average current loads, 
as shown in the plot on the right. For example, the output 
voltage reaches 2.5V after 2400s for a duty-cycle of 10% 
and after 2950s for a duty-cycle of 90%; this means a 23% 
life-time extension. 


Figure 6: Battery Life for Different Current Duty-Cycles: 1.6A Avg. Load. 


IV. CONCLUSIONS 


In this paper we have presented and discussed the results of 
an experimental analysis on the impact of various factors, 
such as average current value, current frequency and current 
duty-cycle on the life-time of Lithium-Ion batteries. The 
investigation has been possible thanks to the usage of an 
accurate, high-level battery model that has enabled the 
simulation of realistic current loads in very short time. 
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calculations have been made by using the Transmission-Line 
Modelling (TLM) method. The modelling process is described 
and the obtained numerical results, for the case of dielectric 
sample placed on the bottom of the cavity, are verified in 
fram reference [1]. Also, TLM method is applied to the more 
real case when the inhamogeneous dielectric sample is raised 
results are presented and discussed. 


Index terms— TLM. cavity, inhomogeneous load 


L INTRODUCTION 


Modellimg of flowmg type microwave applicaioss, 1¢. 
tunnel type applicators, tums out to be of a great practical 
importance. Movable applicator load (which moves along 2 
Ime) could be represented m the form of dielecinc slab 
load raised from the bottom of the cylindncal cavity and 
whose electrical parameters are continually changed along 
the moving direction dependmg on the temperature 


Variation in the particular load sample. Simplified case of 


inhomogencous lossless oad, placed on the bottom of the 
rectangular cavity (Fig.l, =O), was discussed m reference 
[1]. m order to calculate and investigate the effect of the 
dielectnc mbhomogeneity on the resonant frequencies. The 
resanant condition is obtamed by usmg the transverse 
resonance (TR) method [2.3.4] and solving of the integral 
form of an appropniate characteristic equation. 


However, this approach for the real case when load is 
raised above the cavity floor (rs), except in a symmetric 
case (7=s), faces up to the difficulties of numerical nature 


and cannoi give a satisfactory solution. The limitations of 


this approach can be overcome by using the Transmission- 
Lime Modellmg (TLM) method [5] TLM method is a 
general. clectromagnetically based, numerical method, 
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which was affirmed in the area of modelling of cylindncal 
metallic cavities, loaded by lossy dielectric samples of 
complex shape [6]. 


Figure 1. The rectangular metallic cavity loaded with an 
inhomogeneous dielectric sample: 
a=35 an, 5=37 am, 26.9 cm 


The aim of this paper is to represent the possibilities of 
TLM method for modelling of rectangular metallic cavity 
loaded by an mhomogeneous lossless dielectnc sample, 
which is raised at the arbitrary height above the cavity 
floor. In order to verify TLM numenical results of resonant 
frequencies for TEj9,; dommant mode, companng them 
with the results based on the TR method, two characteristic 
locations of mhomogeneous load are considered: on the 
bottom (0) and in the middle of the cavity (7s). Also, 
TLM method is applied to one real case of tunnel type 
microwave applicator when mhomogeneous load is located 
at the height -—4 cm from the bottom of the cavity. 


IL THE INTEGRAL TRANSVERSE RESONANCE METHOD 


An approach to the calculation of resonant frequencies m a 
microwave rectangular cavity loaded by an inhomogeneous 
lossless dielectric sample was presented in reference [1]. 
The resonant condition was given in the form of both an 
integral equation and a transcendental characteristic 
equation. By applymg the TR method m a well-known 
manner, the loaded cavity was presented as a cascade 
connection of the equivalent transmission lines [2] (Fig.2) 
and the characteristic equation was obtained in the form: 
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Im{Z“in} + Im{Z“in} = [ka (ko). ka (ko. )1=0 (1) 


a 
a 


Figure 2. Equivalent transmission line model of the cavity 
in Fig. 1 


b * : : 
where: Z*jn and Z = are input impedances below and 
above the reference plane p-p’, respectively; ky (ko, x) and 
k,(ko,x) are phase coefficients along the z axis in the 


lossless dielectric sample and air, respectively, satisfying 
the following relationships: 


k2 (ko.x)=ké —k2 (ko, x)—(nn/b)* (2) 
k2(ko.x)=e,(x)k¢ -—k3 (ko,x)—-(nn/b)?. (3) 


For known dependence ¢,. (x), the phase coefficient along 
the x axis k,(kg,x) (ko is the free space wave number) 


across a cross-section could be obtained from Eq.1. The 
resonance frequencies of the cavity, for a specified mode of 
oscillation, can be calculated from the condition that 
k, (ko, x) should satisfy the integral equation written in 


the form: 
a 
[kx (ko. x)de = mn m=O0120. (4) 
0 


In the case of homogeneous dielectric sample, k, is not a 
function of x, the resonant condition from Eq.4 is simple 
k,a=mm, and resonant frequencies of the cavity can be 
obtained only from Eq.1. However, for homogeneous 
dielectric sample raised above the cavity floor and/or with 
losses, it was shown that the resonant frequency curves, 
obtained from Eq.1, versus filling factor t/h are with breaks 
and depend on the reference plane location [2]. These 
problems were overcome by using the complex frequency 
o =o , +Jjo; [3,4] wherein the real part , denotes the 
circular resonant frequency and the imaginary part o, 
denotes the damping factor of the cavity and defining the 
characteristic equation in the form: 


b 
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It is obvious that the complex frequency introduction (i.e. 
ky) in the Eq.4, for the case of inhomogeneous dielectric 


sample raised from the bottom of the cavity, makes 
numerical solution of this equation very difficult. 


III. PROBLEM MODELLING 


In TLM method, an EM field distribution in three 
dimensions for a specified mode of oscillation in a 
microwave rectangular cavity is modelled by filling the 
field space with a network of transmission lines and 
exciting a particular field component in the mesh. EM 
properties of a medium in the cavity are modelled by using 
a network of interconnected nodes, a typical structure 
being the symmetrical condensed node (SCN), which is 
shown in Fig.3. To operate at a higher time-step, a hybrid 
symmetrical condensed node (HSCN) [5] is used. An 
efficient computational algorithm of scattering properties, 
based on enforcing continuity of the electric and magnetic 
fields and conservation of charge and magnetic flux [7] is 
implemented to speed up simulation process. For accurate 
modelling of this problem, a finer mesh within the 
inhomogeneous dielectric sample and cells with arbitrary 
aspect ratio suitable for modeling of particular geometrical 
features, are applied. 


Vynz 


Figure 3. Symmetrical condensed node 


In many industrial processes, the water is often used as a 
cavity load. Under temperature influence its dielectric 
properties vary causing that water is inhomogeneous 
dielectric sample. The dielectric properties of water may be 
expressed, by using the dispersion formula [8], as: 


aes 
pitas wie 0 ee (6) 


é” sida cu 7 (7) 
1+} 
r 
where: €9 and ¢,, are the static and infinite frequency 


dielectric constants [8], respectively; X is the relaxation 
wavelength of the incident radiation. The parameter } , 
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was obtained from experimental data as functions of 
temperature and for liquid water can be expressed as: 
a 


8 
T +273 oe 


Rh, =3.3836*107* exp f 
where: T is temperature in Celsius. Equations (6)+(8) 
describe fairly dielectric behaviour of the water in the 
temperature range (-20+50) °C and in the frequency range 
from DC to 10 GHz. 


TV. NUMERICAL ANALYSIS 


The numerical results, which illustrate the effect of the 
dielectric inhomogeneity on the resonant frequencies, are 
presented for a cavity with rectangular cross-section (Fig. 1) 
and selected locations of inhomogeneous dielectric sample. 
Dimension of the investigated cavity are chosen starting 
from the example used in [1]: a=35 cm, 5=37 cm and 
h=26.9 cm. How the microwave applicator is often used for 
drying of wet material, which, as a dominant element 
within itself have water, it is assumed that the permittivity 
of hypothetical inhomogeneous dielectric sample is equal 
to that of water. For simplicity's sake, only relative 
permittivity is taken into account (lossless sample) and its 
temperature dependence, obtained from Eq.6, is shown in 
Fig.4 for different frequencies. For linear temperature 
variation in the x direction in the range (5+50) °C and 
frequency /=1 GHz, relative permittivity of water 
decreases approximate linearly (from 85.26 to 69.99 along 
the x axis). 


Relative permitti vity of water 


20-10 ny) 10 20 30 40 50 
Temperature (°C) i 


Figure 4. Temperature dependence of relative permittivity 
of water for different values of frequency 


Inhomogeneous dielectric sample of constant thickness 1 is 
modelled in TLM mesh by dividing it into regions of equal 
length (4;_) =£; ) in the x direction (Fig.5). Such of these 
regions is considered as a homogeneous dielectric. 
Resolution of the applied TLM mesh for each region of 


water sample (number of nodes m,;) varies in the x 


direction (ratio Ax? / Ax?, <1) and it depends on its 
relative permittivity. Also, resolution of the applied TLM 
mesh in the z direction is different in water and air layer 


(Azd / Az3 <1) and depends on the relative permittivity of 
water. The mesh has been excited with impulse field 
excitation H, to enhance the TE,,9, mode, of which 
TE}9, is dominant mode. 
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v | F ae = 
Ls WSS SESS VY MECC 
Ws Sas 
<< WCC CG 
WANES SSeS 
“a ate \ SO Ss d 
=~ oO Ard 
aa AGRA 
A] re 
| — Ard, Ax? 
=, | 
r | | 3 
ams 
: } 
——— eal a 


Figure 5. A non-uniform TLM mesh in xz plane 


Numerical results of resonant frequencies for TE} 9; 


dominant mode versus filling factor t/h, based on the 
integral TR method, are shown in Fig.6 for mhomogeneous 
dielectric sample with linear temperature variation 
(5+50) °C (solid line), and two hypothetical cases of 
homogeneous sample with extreme temperatures: 7=5 °C 
and 7=50 °C (dashed lines). In the same figure, the star 
symbols indicate the results, obtained by using TLM 
method. As it can be seen from Fig.6, there is an excellent 
agreement between TLM and integral TR method results. 
Another interesting result is that the resonant frequencies 
for an inhomogeneous sample with a temperature gradient 
from 5 °C to 50 °C are within the limits determined by the 
two homogeneous samples. 


The results shown in Fig.7 are concerned to the case of 
symmetrically placed inhomogeneous dielectric sample 
(7s) as a special form of sample raised above the cavity 
floor. Again, it can be noticed good agreement between 
TLM and TR method results. Also, it should be pointed out 
that, in this case, the resonant frequency calculation, based 
on Egs.1 and 4, is only possible by applying the magnetic 
wall in the middle of the cavity (plane A/2). 


According to previously showed results in Figs.6 and 7, a 
general conclusion can be derived that TLM approach 
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gives valid results. On the other hand, application of TLM 
method is not limited by dielectric sample location. 
Because of that, a successful TLM modelling of microwave 
applicator, when load is raised above the cavity floor, is 
quite expectable. Regarding to this, an extensive numerical 
analysis of the behavior of resonant frequencies with 
different raising values is done. In order to illustrate the 
results, resonant frequency curve of TE, 9; mode versus 


filling factor, obtained by using TLM method for the case 
when inhomogeneous lossless dielectric sample is located 
at the height 7-4 cm from the bottom of the cavity is 
shown in Fig.8. It can be noticed the expected behavior of 
resonant frequencies. Namely, the TLM results are placed 
between the curves for two extreme cases: J=5 °C and 
T=50 °C 


0.8 


Resonant frequency (GHz) 


0.00 0.05 0.10 0.15 0.20 


Filling factor t/h 


Figure 6. The resonant frequency versus filling factor t/h 
for r=0, calculated by using: a) 3-D TLM method (*) and 
b) integral TR method (solid line) 
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Figure 7. The resonant frequency versus filling factor t/h 
for r=s (symmetrical case), calculated by using: a) 3-D 
TLM method (*) and b) integral TR method (solid line) 


V. CONCLUSIONS 


The 3-D TLM method for modelling of microwave cavity 
loaded by inhomogeneous dielectric sample is presented in 


this paper. This numerical technique has _ been 
implemented in the appropriate software and applied to the. 
problem of determining resonant frequencies. The obtained 
numerical results show that these resonant structures can 
be successfully modelled by TLM method, independently of 
dielectric sample location in the cavity. Having in mind its 
properties, losses of the inhomogeneous dielectric sample 
will be included in the future work. Also, it should be 
pointed out that the higher-order resonance of the cavity as 
well as complete six-component field descriptions of 
specified modes could be also calculated with satisfied 
accuracy. 
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Figure 8. The resonant frequency versus filling factor t/h 
for 7=4 cm, calculated by using 3-D TLM method (*) 
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Thermionic Emission Process in Carrier 
Transport in pn Homojunctions* 


Petar Biljanovi¢, Member, IEEE, and Tomislav Suligoj, Student Member, IEEE 


Abstract—Thermionic emission is treated in pn homojunctions. 
Thermionic current over pn junction barrier is derived 
directly from Maxwell-Boltzmann distribution function as a 
difference between the majority carrier flow and the flow of 
excess carriers in the opposite direction. It is shown that the 
excess carrier concentration is determined as an equilibrium 
between thermionic and diffusion process, which depends on 
quasi-neutral region width. Total current is limited by 
thermionic process for very thin quasi-neutral regions. Such 
an approach enables the determination of boundary condition 
for any doping profile. 


NOMENCLATURE 


D, Electron diffusion constant 
EFn Electron quasi-Fermi energy 
Ex, Hole quasi-Fermi energy 
Eg Energy gap 


E, x-directed carrier energy 

h Planck’s constant 

Six diffusion electron current density 

Jthn+  X-directed thermionic electron current density 
Jie -x-directed thermionic electron current density 
k Boltzmann constant 

lis Electron diffusion length 

m, Electron effective mass 

Non Equilibrium electron concentration on n side 
Nop Equilibrium electron concentration on p side 
Ngo Excess electron concentration on p side at space 


charge region edge, for uniform doping profile 
n,(0) Excess electron concentration on p side at space 
charge region edge, for non-uniform doping profile 


Nn; Intrinsic carrier concentration 

n, Concentration of electrons with x-directed 
velocities 

Pop Equilibrium hole concentration on p side 

q Elementary charge 

T Temperature in Kelvins 


U Applied voltage 

Ug Built-in potential 

Ur Thermal voltage 

Vihx x-directed thermionic velocity 

Va effective diffusion velocity 

Wp width of quasi-neutral region on p side 


The authors are with the University of Zagreb, Faculty of Electrical 
Engineering and Computing, Unska 3, HR-10000 Zagreb, Croatia. 


I. INTRODUCTION 


Conventionally, pn junction current-voltage characteristic is 
described by drift-diffusion model proposed by Schockley 
[1]. It is derived from diffusion equations for minority 
carriers on each side of the junction. Although it is proved 
by many measurements that it describes pn junction 
characteristic very accurately (e.g. [2]), it does not give a 
clear picture of transport process in pn junctions, i.e. it does 
not describe how the majority carriers come to the other 
side of the junction, where they become minority carriers. 
Boundary conditions are set by solving Poisson’s equations, 
then obtaining built-in potential and replacing it by total 
bias, including applied voltage. 


Since the carriers are injected across the pn energy barrier 
(space charge region), this process is thermionic emission. 
In 1949 it was proposed by Moll [3] that pn junction 
transport is two step process: carriers are first injected 
across the space charge region by thermionic emission, 
which is followed by diffusion process through quasi- 
neutral regions. Those two processes are in series and the 
slower one, that is usually diffusion, limits current through 
the junction. For pn homojunctions with moderately wide 
quasi-neutral regions (w>>ZL) the effective diffusion 
velocity vz =D,,/n(dn/dx), is much less than x-directed 


thermal velocity vy, =4/(KT)/(2mm) , and carrier transport 
is determined by diffusion process. With shrinking the 
quasi-neutral region width, the effective diffusion velocity 
is increased due to the increased concentration gradient, and 
when it becomes comparable to the thermionic velocity 
(va = Vy,x), the thermionic process becomes the limitation of 
a current. This is the case in thin bases (100 nm) of bipolar 
transistors. This phenomena was observed since measured 
cut off frequencies for thin bases bipolar transistors were 
lower than the ones predicted by classical diffusion theory. 
It was first noted by introducing thermionic saturation of 
current empirically [4], and latter on by direct integration of 
Boltzmann transport equation [5]. Recently, carrier 
transport was examined by McKelvey’s flux method [6]. 


* This work is supported by the Ministry of Science and Technology of the 
Republic of Croatia under 036001. 
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Fig.1. Energy band diagram of pn homojunction in equilibrium. 


Another way of how thermionic saturation of pn junction 
current can be observed is when thermionic emission 
process is reduced for given diffusion current [7]. This is 
the case in heterojunctions where thermionic current is 
decreased due to the spikes in energy diagram [8]. 


In a fact, heterojunctions initiated the introduction of 
thermionic emission and although the similar approach is 
applied to homojunctions [9](primarily to thin base bipolar 
transistors), as mentioned above, it seems that thermionic 
emission process is generally overlooked in pn 
homojunctions. 


In this paper we demonstrate a physical approach of 
deriving thermionic current directly from Maxwell 
Boltzmann statistics, and treat pn carrier transport in details 
for both reverse and forward biased junctions, for thin and 
thick neutral regions and for non-equilibrium carrier 
distribution. Furthermore, since all of the considerations 
made before dealt with uniform and abrupt pr junctions, we 
propose a general approach for any doping profile. 


II. THERMIONIC EMISSION CURRENT 


The pn junction energy-band diagram with uniform p and n 
sides in the equilibrium state (U = 0) is shown in Fig. 1. For 
an abrupt junction, a built-in potential is given by: 


ge: (1) 


A nondegenerate semiconductor is analyzed, what means 
that Fermi level energy lies within the band gap at a level 


which is more than 3kT below the top or above the bottom 
of the gap in an n or a p type semiconductor, respectively. 
Hence, the x-directed Fermi-Dirac distribution function can 
be approximated by x-directed Maxwell-Boltzmann 
distribution function, which is, for electrons in the 
conduction band: 

Ps 


as m, Ern aa EY 
dn, = vibarolsh sil ae Wah (2) 
where: 
1 
E, = Eg t+ >MVing » (3) 


is total, x-directed electron energy. The distribution function 
given by (2) is depicted in Fig. 1. for both majority and 
minority carriers on both sides of the junction. The analysis 
will be done for the electrons, but analogue considerations 
are applicable for holes. 


Generally, the thermionic current of electrons is given by: 


Jinn = [rmedn, : (4) 
(vx) 

Integral term in (4) can be written in a term of x-directed 
energy by using equation (3). The electrons that can be 
emitted over the barrier to the p side of the junction must 
have -x-directed energies larger than E, = Eg +qU, . The 
number of such electrons is proportional to the area A in 
Fig. 1. Therefore, this value of £, should be used for the 
lower bound of integral and thermionic current in +x 
direction can be written as: 


(Eps E 
Jin ntl 4nq me kT fexp “ - dE, : (5) 
i i e 
EG+quyx 


By using relation: 


3 
fi: =o melt T)? exp= Enea (6) 


equation (5) yields: 


kT —qux 
Sittin = Non &X : 7 
th, 1 Sm, on XP (7) 


According to (1), the electron concentration on the p side of 
the junction is: 


2 
Wee e date nistex = EL 8 
Op Poy on XP kT ( ) 
and this term represents the effective concentration of 
electrons, which move across the space charge region 
(SCR) in -x direction. The root term in (7) represents the 
mean thermionic velocity in +x direction, also called 
Richardson velocity [6]: 


Vin, x = > (9) 
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Fig. 2. Energy band diagram of forward biased pn homojunction, 
and corresponding minority carriers concentration. Since 
Maxwell-Boltzmann distribution is not valid in non-equilibnum, 
distribution functions for minority carners are not depicted. 


and is equal to one fourth of mean thermal velocity 
Vinx =n /4 : 


Considering the electrons on the p side of the junction, it 
can be shown that the electron current from the p to the n 
side (-x direction) is given by: 

m, t Ex, =e 

3 joxp—_— dE, (10) 


( 


J thn = 4ngkT 


what is proportional to the area B in Fig. 1. After solving 
integral it is shown that (10) is equal to (7) what means that 
J thn=Jt,n+ and area A is equal to area B in Fig. 1. Since pn 
junction is in equilibrium and there is no excess carriers, 
electron diffusion current is Jz,=0, and the total net current 
through the junction is equal to zero. 


A. Forward biased pn junction 


Fig. 2. depicts the pn junction in non-equilibrium, in 
forward biased region. The energy barrier for electrons on 
the n side is reduced from qUx to q (Ux — U) < qUx. If qUx 
is replaced by q (Ux—U), equation (7) can be written as: 


-qUx -U) , 


J thins = VW th.x"on exp kT 


(1) 


Since the x-directed mean electron velocity is derived by 
using Maxwell-Boltzmann exponential distribution function 
as weighting factor, equation (9) does not depend on the 
energy barrier height and is equal for any applied voltage. 
Therefore, carriers always move across SCR at velocity 
given by (9), and the current varies by varying the effective 
concentration of carriers. 


Electrons injected over the barrier increase the 
concentration of electrons on the p side, near the space 
charge region, (depicted in Fig. 2. as myo). The excess 
electrons have arrived from n side and they all have +x 
directed velocity. Maxwell-Boltzmann distribution function 
is not valid since electrons and holes are not in thermal 
equilibrium. Several collisions must take place to make 
velocity distribution become isotropic. The typical value of 
minority carrier free path is of the order of 50 nm, what is 
far from SCR. Anyway, the electron current in -x direction 
is determined by the concentration at SCR edge, n(0) = no 
[6], and can be written as: 


J than- fo YW thx" po , (12) 


what is the key assumption of current balancing concept 
[9]. Due to many collisions and increased concentration at 
SCR edge, the transport of excess electrons through quasi- 
neutral p region is characterized as diffusion process 
described by diffusion equation. The net current across SCR 
must be equal to diffusion current: 
dn 
J thins =e =J an = qD,, he : (13) 
Generally, distribution function for uniform quasi-neutral 
region is obtained by solving diffusion equation, and yields: 
n Ff | Woe 
n,(X)=Nop Pfeil Ure as (14) 
sh(w , /L,,) L 
Substituting (11), (12), and (14) into (13) and using (1), the 
electron concentration at x = 0 is obtained: 


x=0 


n 
fllt= orl “A isaah - ge. 08,. pita oy ae _ (15) 
ie D w 
Ne n cth Pp Vinx n Ln 
v 


th.x*“n n 


It is worth noting that if the width of quasi-neutral p side is 
much greater than electron diffusion length (w, >> L,), the 
electron concentration at x = 0 is: 


n 
Ny =— Wa ee 2 D, } (16) 
pene Vinx’ n 


VinxLn 


Ratio D,/L,, determines the effective diffusion velocity (for 
w, >> L,), which is given by vz = D,/n (dn/dx). Hence, the 
term D,/(vy,xL,) is a ratio between diffusion and thermionic 
velocity v/Vu,x. Assuming D, = 50 cm?/s and L, = 10 pm, 
va = 5-10° cm/s, while vy,x calculated by (9) is viz = 10’ 
cnv/s. It can be concluded that the ratio vy» x 1S normally 
much smaller than one for wide quasi-neutral regions. 
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Equation (16) than yields standard, Boltzmann boundary 
condition, nyo = noe”. 


If the width of quasi-neutral p side is much smaller than 
electron diffusion length (w, << L,), diffusion length ZL, in 
(16) must be replaced by the width of quasi-neutral region 
w,. If w, is so much decreased that the effective diffusion 
velocity (vz = D,/w,, for w, << L,) becomes much larger 
than thermionic velocity vy, and the term D,/(vz,.w,) 
becomes much larger than one, (16) can be written as: 


Vp yW D 
th, 
= Nop ———| er + —*_ |. (17) 
n Vinx” p 


For higher voltages, if U is greater than several U7, and for 
moderately wide p side w,, the second term in bracket in 
(17) can be neglected. On the other hand, if w, > 0, 
Npo —> Nop, and the pn carrier transport is limited by 
thermionic emission, it is purely ballistic. The typical quasi- 
neutral region widths when thermionic emission starts to 
limit the transport (when vj/vy,, becomes comparable to 
one) are 100 nm, which can be a base width of modem 
bipolar transistors. The thermionic emission is the reason 
why the measured values of cut-off frequency are in 
discrepancy with theoretically predicted ones. 


If (15) and (14) are put into (13) a pn junction current can 
be obtained. Since it was shown that carrier transport is 
purely diffusive for w, >> L,, the total pn junction current 
is equal to expression derived by drift-diffusion concept. In 
the case of w, << L, the electron component of pn junction 
current is given by: 


D 
Ji= Qioy —2 1 _(.Uits ag (18) 
dg) (oe et 
Vinx p 


If the ratio of diffusion and thermionic velocity D,/(Vu,.,) 
in the denominator of (18) is much lower than one, equation 
(18) becomes standard diffusion current equation, while for 
w, — 0, (18) becomes: 


“8 = tro Ving CU? 4); (19) 


and this represents the upper limit of pn junction current, set 
by thermionic emission process. 


B. Discusson 


For moderately wide quasi-neutral regions, thermionic 
emission is much faster process than diffusion (Vy, >> va), 
and thermionic currents, either Jy,,+ or Jy,,.,are much 
greater than diffusion current Jz,, which is determined by 
their difference, according to (13). Therefore, it can be 
concluded that Jy,,+ =Jinn, i.e. the majority of electrons 
injected from n side are reflected backward. A small 
fraction of them enters into quasi-neutral p side and cause 
diffusion process. Such an approximation enables the 
determination of excess electron concentration at x = 0, for 
any doping profile: 


l oO 
ny Oe Jrmedn eC =%,). (20) 
Vih.x qUx-U) 


Distribution dz, should be obtained from (2), by the 
determination of quasi Fermi level Ep, at x =x,. Built-in 
potential Ux and distance x, should be determined by 
solving the Poisson’s equation. 


III. CONCLUSIONS AND FURTHER INVESTIGATION TIPS 


It is shown that the excess electron concentration on the p 
side is an equilibrium between thermionic and diffusion 
processes. Therefore, the value of nyo depends on w,. If the 
limitation of carrier transport is diffusion process, which is 
the case for w,>>L,, np is determined by standard 
Boltzmann boundary condition. If the transport is limited by 
thermionic emission process, which is the case for w, << L, 
and D,/(Vi,x.W,) comparable to one, 70 tends to equilibrium 
concentration Mo,, as w, decreases. 


Regardless of non-Maxwellian distribution at x = 0, and the 
assumption that excess carriers (71,0) flow from p to n side, 
the carrier transport in pn homojunctions is described by 
(13). On the other hand, it can be assumed that the electrons 
of equilibrium concentration (nop) flow toward n side. Since 
the diffusion and thermionic processes are in series, the. 
total pn junction current can be written as: 


] 1 ] 


= ata ‘ 
J n J an J th, n+ 


(21) 


Such an approach yields total current the same as the one 
given by (18), but it assumes Boltzmann boundary 
conditions, i.e. (70) does not depend on quasi-neutral 
region width. 
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Abstract— This paper presents a new procedure for noise 
modeling of microwave transistors based on the wave 
approach. The correlation matrix elements are calculated in 
terms of three equivalent noise temperatures. The strength of 
noise wave sources and correlation coefficient are shown 
graphically as a function of frequency and discussed. Also, 
the transistor noise parameters are determined from 
calculated noise wave sources by using microwave circuit 
simulator Libra. 


Index terms—noise waves, modeling, microwave transistors 


I. INTRODUCTION 


Small-signal low noise microwave transistors are usually 
represented by their equivalent circuits that include noise 
sources. There are several different representations of a 
noisy two-port circuit, [1]. Each representation is 
characterised by the existence of a noiseless two-port 
network and two additional correlated noise sources. Noise 
is typically characterised using combinations of equivalent 
voltage and current sources. 


A significant contribution to the MESFET noise modeling 
was made by Pospieszalski, [2]. The noise model he 
presented is based on H representation of MESFET 
intrinsic circuit with two uncorrelated noise sources. For 
noise parameter calculation, it is necessary to know 
equivalent circuit elements and two frequency independent 
temperatures called equivalent gate and drain temperature. 
In previous work we extended the Pospieszalski's noise 
model by introducing the correlation between noise sources 
using third complex equivalent temperature, [3]. 


For high frequency circuit applications, however, a wave 
interpretation of noise seems more appropriate. Wedge and 
Rutlege [4] showed that wave approach can be useful for 
noise modeling and measurement of microwave FETs. 


Noisy two-port network can be described by using either 
waves that emanate from its ports or waves that emanate 
only from the input port. Combining the first wave 
approach with Pospieszalski's concept, the noise wave 
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parameters were derived in terms of intrinsic circuit S 
parameters as well as equivalent gate and drain 
temperatures, [4]. 


In this paper we have derived these correlation matrix 
elements in terms of three equivalent temperatures: gate, 
drain and correlation temperature. Comparing in this way 
obtained results with the measured ones, the excellent 
agreement is observed. Also, the standard noise parameters 


(minimum noise figure, F,,,,, noise resistance, R,, and 


optimum source reflection coefficient, I,,,) are expressed 
as the function of scattering parameters and correlation 
matrix elements calculated in the described way. It is 
shown that noise parameters modeled in this way has the 
good agreement with measured ones. 


The complete procedure is CAD-oriented and can be 
implemented in the commercial high-frequency circuit 
simulators, like Libra. 


II. NOISE REPRESENTATIONS OF TWO-PORT CIRCUITS 


Consider the noise equivalent H representation of Figure 1. 
In this case the linear matrix equation describing the noisy 


two-port is 
V, H H Jf 
In} (Ho, H2\|V2) Litas 
The other representation based on noise waves and 


scattering parameters is shown in Figure 2. Noise waves 
6, and b,, contribute to the scattered waves such that the 


wave variables and scattering parameters satisfy the 
following equation 


b Si, S a b 
‘=| i al I+] a 0) 
bo} [S21 S22 JL@2} Lone 

The elements of noise source vector generally are 


correlated and characterised by correlation matrix Cy, given 
by 


Co =| Emi) mbna? 


j (3) 
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where brackets {) indicate time average of the quantity 
inside and * indicates complex conjugation. 


noiseless 
two-port 


[1] 


Figure 1. H representation of a noisy two-port 


oa | 
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two-port 


[S] 


— b, 


Figure 2. S representation of a noisy two-port 


Noise waves a; and b; at the i-th port (i=1, 2) are defined 
in terms of port terminal voltage V;, current /; and 
characteristic impedance Z; as: 


Vosidoh Vj-Z'1, 


By substituting the last relations in (2), (assuming real 
characteristic impedance, Z;=Zy) the. formulas for 


voltage and current noise sources e,, and iz, in terms of 


gs 
noise wave sources b,, and b,» can be derived: 
2 bn ¥Zo 
ey, =————__,, 4 
ine = 2 Sy18n1 wb (5) 
¥Zo (1+ S22)(1- S11) 2 (1+ S22) 
and 
* 2 * 
: 4$91 (bm ) 4(bnb 
Costas oe 1 | nil a ( nl 02) (6) 


Kent a Oeeay (1-S,,)+S}2) _ 


III. MESFET NoIsE MoDEL 


A typical equivalent circuit of MESFET and HEMT chip 
including noise contribution of intrinsic circuit is 
considered, as it is shown in Figure 3, where 


Ym = & me ’°*. The intrinsic part of the circuit is denoted 
by the broken line. 


The noise properties of an intrinsic chip are treated by 
assigning equivalent temperature T, and Tz to the 


resistances R,, and R,,, respectively, so that 


2 
(legs| ) = AKT R, AF , (7) 
(liasl’) = a4r aus (8) 


where k is Boltzmann constant and Af is noise bandwidth 
(it is assumed Af =1Hz). 


With the aim to express the correlation between the noise 
sources, we introduced the third equivalent temperature- 
correlation temperature, which has a complex value, 


= IT, e°" , in the following way: 


(egsigs) =A4kT Af . (9) 


Figure 3. Equivalent circuit of MESFET and HEMT chip 


By using this concept of noise temperatures and equations 
(4)-(6), noise wave parameters for the intrinsic circuit are 
obtained in the following form: 


(bm?) = KT, (1=[Su). (10) 
(nal) = de (i sp) gah + 
l1—S),| (1) 
+2Re{kT.S5i(1+S55)}, 
Ss : 
(mbna) =< 4 (bral) -#TeA- Sis )(1+ S22). (12) 


1s 


The expressions for the noise wave parameters in terms of 
the intrinsic equivalent circuit elements of Figure 3 are: 


AKT pRysZ9® Cas 


2 g**gs 
(onl = (13) 
¢ 1+07C2,(Rys + Zo)” 
(b Ale ze AkT Ro sZoSmRas 
n 
(Zo + Ras) [1+07C35(Res +Zy)° ivy 
4kT yRasZo =8R kT Ym Ra.Zo (14) 
2 . a) 
(Ras +Zo) 1+ joCgs(Rgs +Z0)J (Ras +Zo) 
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IV. NUMERICAL EXAMPLE 


The proposed procedure for the extraction of noise wave 
sources is applied to various MESFET and HEMT 
transistors. The results presented in this paper are related 
to NEC MESFET N71000A. For this transistor there are 
manufacturer’s S-parameter data in the frequency range 
(1.5-26.5) GHz and noise parameter data at several discrete 
frequencies in the range (2-18) GHz. 


The values of the MESFET equivalent circuit elements are 
extracted from the S-parameter data by using standard 
circuit simulator Libra. The element values obtained for 
model shown in Figure 3 are: R,=0.2772, L,=0.16nH, 


R#2.023Q, L=0.065nH, R,=4.3880, L,=0.028nH, 
n=48.57MS, 1=3.345ps, C,-0.052pF, R,,=3.10, 


C,.=0.282pF, Ry=245Q, Cy,=0.088pF. 


Besides, it is necessary to express three equivalent noise 
temperatures Tn T, and 7, versus frequency, [3]. 
Calculated parameters are inserted into the equations (13)— 


(15). 


The noise wave parameters for N71000A transistor are 
shown in Figure 4. With the aim to verify the presented 
procedure, the relations which express correlation matrix 
elements in terms of intrinsic circuit noise parameters and 
S-parameters, [4], are applied, 


s 

(bmi?) = Ko Fri ~ 1 [Sis]? -1) + te Sie 
Zo lt +Topy 

2 2 4R heel 
bral”) =|Saal" To} Fin —1+2——+ |, (17) 

20 | +Topu| 
Onbag) = on  ).  8) 

lt + Tope Zo Su 


These intrinsic circuit parameters are obtained from S and 
noise parameter measured data for the whole transistor in 
chip or packaged form. For this purpose, a de-embedding 
process is done by Libra, by adding parasitic elements with 


negative values and in reverse order to the device two-port 
circuit. 
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The curves obtained by using equations (16)—(18) are also 
shown in Figure 4 and denoted by Ref. [4]. Good 
agreement between these two curves for each correlation 
matrix element is observed. However, there are not a 
complete agreement between them because the expressions 
(10)-(15) were derived assuming that S,,=0, while in the 


de-embedding procedure S, #0. 


<|b, ,|2> * 1022 
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Figure 4 a) The strength of two noise wave sources b) The 
magnitude and angle of correlation term 


V. TRANSISTOR NOISE MODELING USING NOISE WAVE 
TEMPERATURES 


Standard noise parameters can be expressed in terms of 
correlation parameters and S parameters, [4]. A procedure 
for such noise modeling of microwave transistors is 
implemented in Libra. 


The set of these equations is programmed in Libra using 
"equation" capability of circuit simulator. The correlation 
matrix elements and S parameters are calculated for 
equivalent circuit parameter values extracted in the manner 
described in previous section. In this case three equivalent 
noise temperatures are obtained from noise parameter data 
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by using the procedure proposed in [3] which includes an 
optimisation routine. Obtained noise temperatures are: 
T,=153.96K, T=2018.7K, | 7,|=37.8K, t,=11.3ps. 


Four noise parameters are calculated for whole transistor 
circuit including parasitic elements. The noise parameter 
characteristics are shown in Figure 5. The curves obtained 
by using the proposed model are denoted by MOD. The 
referent curves based on the data measured by 
manufacturer are denoted by MEAS. 


The comparison between curves in Figure 5. shows a good 
agreement between measured and modeled noise 
parameters. Several other transistors have been analysed 
and modeled with similar results. 


VI. CONCLUSION 


In this paper we derived the complete expressions for noise 
wave sources in terms of three equivalent temperatures: 
gate, drain and correlation temperature. These equations 
include correlation between noise wave sources as complex 
value with real and imaginary part, which was not 
encountered in the literature before. These relationships are 
verified by using expressions from literature which connect 
the correlation matrix elements with four noise parameters 
for noise wave representation. The noise wave correlation 
matrix elements are calculated for MESFET N71000A and 
shown graphically as function of frequency. A set of 
equation describing the-noise parameters for noise wave 
representation as the function of noise wave sources and S 
parameters is implemented within the. circuit simulator 
Libra. Noise parameter characteristics obtained by 
proposed modeling procedure are compared to measured 
ones and quite good agreement is observed. 
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Figure 5. Noise parameters of NEC MESFET N71000A: 
a) optimum reflection coefficient b) minimum noise figure 
c) normalised noise resistance 
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Abstract-- A novel, sophisticated and_ versatile 
Periodically Poled Lithium Niobate (PPLN) 
waveguiding structure is designed to operate as an all 
optical-transistor, at environmental temperature and in 
the second band of fiber optic communication. The 
interaction among guided and leaky modes is obtained 
by means of two different techniques: i) birefringence 
mode-matching and ii) quasi-phase-matching by 
periodically poling the lithium ferroelectric domains. 
The correct operation of device at the wavelength 1320 
nm requires a strong diffusion of Mg in the region 
external to the periodically poled waveguiding channel, 
even in order to maximize the bounding of the e.m. 
field and the overlapping integrals. 


Index terms — optical amplifiers, lithium niobate, optical 
communication. 


I. INTRODUCTION 


Several millions of kilometres of optical fibers have been 
installed in the last years because they constitute an ideal 
communication medium. Thus one of the most important 
goals, for both applied research and industry, consists in the 
design, engineering and construction of high-performance 
optical devices as efficient laser sources, optical amplifiers 
and all-optical devices advantageously operating on the 
commercial optical fiber window. 

The phase shift in noncentrosimmetric crystal, by 


(2) 


using cascaded second order nonlinearity 42) SHY, aaneliCcts 


has been largely demonstrated and experimented in the 
recent years to construct optical switches [1-2] or 
amplifiers. A proposed amplifier [3], was based on Z-cut, 
Ti diffused lithium niobate waveguide and utilized both the 
up-conversions Type I and Type II interactions. This 
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device, in order to operate in the third band of optical fiber 
i.e. at 1550 nm_ wavelength, required a phase-matching 
temperature control (close to 401.1 °C) to enhance Type I 
second harmonic generation while the Type II suffered a 
mismatch depending by modal dispersion. The spreading of 
the amplifiers made of Er-doped optical fiber has addressed 
the research interest towards the investigation and design of 
devices operating at wavelengths around 1550 nm where 
unfortunately the fiber dispersion limits the maximum 
distance of transmission. Well-known solutions are 
compensation of dispersion and wavelength conversion but 
the ideal strategy consists of designing amplifiers operating 
at fiber zero-dispersion wavelength. 

In this work a novel sophisticated Periodically Poled 
Lithium Niobate (PPLN) waveguiding structure is designed 
to operate as an all optical-transistor, at environmental 
temperature and in the second window of fiber optic 
communication, i.e. at the wavelength 1320 nm . This goal 
is attained by constructing a grating made by periodically 
inverted ferroelectric domains on a lithium niobate crystal 
having the dielectric tensor in equatorial configuration. The 
birefringence, i.e. the orientation of the optical c-axis in the 
equatorial plane xz with respect the propagation direction, 
is utilized to obtain the Type I mode-matching, while the 
periodic poling of ferroelectric domains is utilized for the 
Quasi Phase Matching (QPM) of the Type II interaction. In 
particular, the interaction in the proposed structure occurs 
among hybrid modes having both TE and TM components, 
this nature originating not only from the rectangular section 
of waveguide but, first of all, from the arbitrary orientation 
of c-axis in the xz plane). The hybrid guided (G) and 
leaky (L) modes (propagating below cut-off) [1-2] are 
involved in both the Type I 


Goo (Mp) + Gho (Mp) > Loo (@sx ) and the skype It 


Gio (Mp) + G2, (Op) > LO) (sx) interaction on which 


the device operation principle is based. The apices x and y 
indicate the predominant TM and TE polarization natures, 
respectively. It is worthwhile to notice that the 
simultaneous employment of birefringence and QPM 
techniques is possible only if an optimized profile of 
ferroelectric domains is designed, by calculating the most 
opportune duty-cycle. Moreover the correct operation of 
device at 1.320 um wavelength requires a strong diffusion 
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of Mg [1-2] in the region external with respect to the 
periodically poled waveguiding channel, in order to obtain 
the mode matching and maximize the bounding of the e.m. 
fields. 


Il. THEORY 


The second order nonlinearity properties of LiNbO3 


crystals having the optical c-axis in the equatorial dielectric 
tensor configuration as shown in Fig. 1, are expressed by 
the equation: 


Pj (xX y,Z)= dijk (y,X, Y,Z) Ej (x, y,Z) Ex (x, y,Z) 
Where P is the second order polarization vector and E is 
the electric field expressed in the xyz frame. In particular: 


di5(y) = d3\sin(y) + do cos(y) 

dig (¥) = d3) cos(y) — dy7sin(y) 

d51(¥) = 416 (Y)) 

dyo(Y) = d5ysin3(y) + 3d3) cos(y)sin* (Y)+d33 cos*(y) 
do3(Y) =doo cosy)? sin(y) +33 cos(y)sin? (y) + 


+ d3, cos(y)[cos(y)? — 2sin? (y)] (1) 


d54(Y) =—do cos(y)sin(y)* +d33 cos(y)7 sin(y) + 
+d3sin(y)[cos (y)? ~2cos*(y)] 
d31(y) =dy5(¥)s d30(¥) =do4 (9) 


d33(Y) =—d cos*(y) + 3d3, cos(y)7sin(y) + d33sin°(y) 
d34(¥) =do3(Y) 


where, using the contracted notation, djj are the elements of 
the second order non linear tensor in the principal crystal 
AXESE N= Y= eS while dij) are the elements of the 
second order non linear tensor in the rotated frame: the 
elements not reported are zero. 

The inversion of ferroelectric domains obtained via a 
poling, x-directed electric field, produces a grating with a 
rectangular profile. Because hybrid modes with both TE 
and TM components propagate (this being due to the 
arbitrary oriented optical c-axis) both d33 and d3, elements 
are involved in the interaction [1-2]; the alternation of 
positively and negatively polarized surfaces in both cases 
can be expressed via the Fourier series: 


, 27Nz 
i 


+00 
ay (a= ney cee (2) 
n=-—co 


where v=1,3 and A is the grating period. 

The conventional coupled mode equations, relating 
the interacting waves are derived by taking into account the 
eqs. (1) and (2). By following the conventional nonlinear 
coupled mode theory [1-2] and normalizing the 
electromagnetic field profile to a unit power flow, the 
perturbed field amplitudes 
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Fig. | Sketch of the all-optical amplifier. 
of the propagating modes (identified by the same apex and 
subscript choice made for the modes) aj (Op) : a (Wp), 


ajo (Msp), 2; (Wsy ) are linked by the following system of 


four differential equations: 


dagq(@ ) 


BE = Thabo (Ost) (a¥y(@))” € JABFSH 7K, (y,D) 
Zi 


— jad) (@sy) (ad, (@L))” lau *K3(y,D) 


da} (@L) sj 

= Tab (SH) po (@L)) e iPorm *K3(7,D) 
dajy(@ 

dad ; j 

OSD = — jag (en,) 98, ()e"PO™ K(4,D) 


The coefficients K,(y,D) are the overlapping integrals. 


Which are the sum of a number of terms. Their calculation 
is very heavy, also for the hybrid nature of the modes 
having all the polarization components. Moreover these 
terms depend by the duty-cycle D of the waveform of 
ferroelectric domains. 


The Quasi-Phase-Matching condition among 1) the 
fundamental Gj (@p) and = G%,(@p) and the ~—_ second 
harmonic L;(@g}4) modes, for n=1, and the birefringence 
mode matching between ii) the fundamental G3) (@p) and 


second harmonic L}g(@gyq) modes are imposed. The Type 


I and Type II phase mismatches _ are: 
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ABesn SP oral and 


20 
ABgem =Br x, ~ Boy “Boy a 
respectively, indicates the phase constant of the 
considered propagation mode. 

Guided and leaky modes of the diffused channel 
waveguide have been calculated via the an appositely 
modified version of SEIM (Simple Effective Index 
Method) [1-2], by taking into account the arbitrary optic c- 
axis orientation. The eigenvalue equation of each slab is 
obtained by imposing the continuity of the tangential 
components of the e.m. fields at the layer interface and 
with the help of the transformation matrix for the diffused 
region. The transformation matrix is numerically evaluated 
via the Gear-Predictor-Corrector Method (GPCM) [1-2]. 
The same GPCM procedure has been used to integrate the 
differential system of coupled equations (3). 


III. SIMULATION RESULTS 


The proposed device is constituted by a Ti:LiNbO3 
channel waveguide, having realistic optical and 
geometrical parameters. The extraordinary and ordinary 
refractive index ne and no, are semigaussian along the x- 
coordinate and gaussian along the y-coordinate: neo = 


Nes,os + Anes,os e-(x/d)2 e(y/w)? where nes=2.1452 and 
Nos=2.2197 are the extraordinary and ordinary refractive 
indices of the substrate at the second harmonic wavelength 
Ash=0.66 Um; Anes = 0.028 and Anos= 0.013 are the 
extraordinary and ordinary refractive index changes at the 
surface; dg = 1.65 pm and dy = 2.2 pm are the 
extraordinary ond ordinary in-depths lengths; the lateral 
diffusion length of the waveguiding channel is and w = 8 
um. In the region out the channel a suitable Mg diffusion is 
considered in order to obtain a decreasing of substrate 
refractive indices Anes = - 0.013 and Angg= - 0.006 [1-2] 
and the waveguiding of the desired modes. 

The behaviour of overlapping integrals as function 
of duty-cycle of the waveform describing the inversion of 
ferroelectric domains is investigated, by changing the duty- 
cycle D in the range 0 +100. As example, Table I reports 
the overlapping coefficients for three different values of D, 


having chosen ABgsy= 0.3808 m!, obtained for an 
equatorial angle y= 66.136°, and ABgpm= 0 obtained by a 
grating period A= 377. 36 um. The more advantageous 
choice of the overlapping coefficients can be therefore 
obtained by a finely tuning of D. It is worthwhile to note 


that for D=50%, type I interaction is neglected and so the 
device does not operate correctly, because the coefficient cg 


of eq. (2) is zero. 


TABLE I 


Overlapping coefficients K; (yD) [W Wim), calculated for an 


equatorial angle y= 66.136°, and a grating A = 377. 36 um, for different 
duty cycles D. 


D 60% 75% 90% | 
Ky [32.652 39.405 46.157 | 
KS 9.4956 7.0599 | 3.0853 
Ky 9.4956 7.0599 3.0853 
K4 31.121 33.715 | 36.309 
TN RKe 3.6554 2.7178 1.1877 
05 - T + 
ee | 


A ua - f 
‘ LA 


damod pezipeuLiony 


Interaction length [mm] 


Fig. 2 Evolution of the normalized powers. Overlapping coefficients 
calculated for an equatorial angle y= 66.136°, and a grating A = 377. 36 


2 
a} (or) =120W . 


2 
atop} = 


Lm, liad 2: = 


On the contrary, for D=100 % only the type I interaction 
occurs while type II is neglected. Fig. 1 reports the 
evolution of the normalized power of the modes, calculated 
as the ratio between 


2 
ado (WsH ) and the 


2 2 2 
xX 
door) 8s @e) Ja01(sn) : 


2 2 
input power P;,= agn(or)| + ee, (Of ) ; 
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L=45 mm, optical c-axis oriented at the equatorial angle y= 66.127°, 
grating penod A = 381. 946 um. 


2 


2 
it is Pj,/2 = ago (oR) = =120 W. In particular 


y 


Fig.| shows the case of larger interaction, 1.e. for D=75% 
and propagation losses equal to 0.2 dB/cm are taken into 
account. In this case the waveguide does not operate as an 
all-optical transistor but the strong interaction among the 
modes is apparent. 

The operation as all-optical transistor is 
calculated for a channel waveguide having length L=45 
mm, optical c-axis oriented at the equatorial angle y= 
66.127° with respect the propagation direction and grating 
period A= 381. 946 um. The corresponding phase 
mismatch are ABpsy = 66.6667 m! and ABopy= - 200. 


The Fig.3 illustrates the normalized power 
2 


2 2 
I(ao(@r) + \ote sthe 


Pout. = pir) 


y 
a9, (Mp) 


fundamental mode Gi (@p), calculated at the device output 


as a function of the power launched at the input section 
P2 


“is reported for three different D 


2 
Pin=8o(e) + fa, (op) 
values: D=60%, D=75% and D=90%. The best performance 
is obtained for D=75% where the curve exhibits the higher 
slope close to Pin= Phiags =145 W, the launched input 
power being the same for both the quasi-TE modes P;,/2= 


2 
ado (Wf) 


QE 
Riley. 
=la8, (op) 


. In other words, close to the Phias 


value, small variations of the input power cause heavy 
changes of the output. Fig. 4 highlights the corresponding 


behaviour of the phase of the G2) (wp) mode for the three 
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Fig. 4 Phase characteristic. of the all-optical transistor. Channel 


waveguide length L=45 mm, optical c-axis oriented at the equatorial angle 
y = 66.127°, grating period A = 381. 946 um. 


aforesaid duty-cycle values. The strong phase change of m 
for D=75% close to Phias= 145 W confirms the strength of 


mode interaction 


IV. CONCLUSION 
A LiNbO, channel waveguide designed to operate as all- 


optical transistor by utilizing both birefringence matching 
and QPM . The preliminary cases presented in this work 
promise further and strong enhancement of device 
efficiency via the suitable choice of the optical c-axis 
orientation and the grating period. 
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Wavelength Conversion in Periodically Poled 
Lithium Niobate Waveguides Exploiting 


Cascaded Second Order Nonlinearity y) ay 2) 


M. Bozzetti, A. D'Orazio, M. De Sario, V. Petruzzelli, F. Prudenzano 


Abstract-- A wavelength converter in the third window of fiber 
optic communication is designed. It is based on a Periodically 
Poled Lithium Niobate (PPLN) channel waveguide and 
cascaded second order nonlinearity effect. Both stationary and 
time domain investigations confirm that the device 
performance is improved by an integrated metal mirror, 
placed after an additional waveguide segment having suitable 
length. This is calculated in order to provide an optical path 
along which the co-propagating electromagnetic waves, 
changing their phase mismatch, can interact more efficiently 
after the mirror reflection. 


Index terms — optical communication, nonlinear optics, 
lithium niobate. 


I. INTRODUCTION 


The construction of efficient all-optical devices is the key 
element of the further development of integrated optics; 
only by achieving this goal, the optic version of the 
conventional electronic circuits will be possible. Large 
efficiency in cascaded second order and in third order 
susceptibility processes are thus important aims of 
nonlinear optics, in order to construct the desired active 
and non linear devices. Double titanium and magnesium 
diffusion was considered by the authors [1-2] to enhance 
the performance of those devices having an operation 
principle based on Type I interaction (i.e. on 
birefringence). However, periodically Poled Lithium 
Niobate waveguides’ (PPLN) are more feasible and 
attractive solution to achieve higher levels of efficiency in 
cascaded second order susceptibility. Their employment 
permits the interaction of the involved propagation for long 
propagation distances and lower power sources. The 
gratings, fabricated via periodic modulation of the second 
order nonlinear elements, are particularly attractive because 
provide a number or advantages; the main of which are the 
selective choice of the interacting modes, the negligible 
walk-off among the propagation modes and, at last, the use 
of light beams having exclusively TM polarization in Z-cut 
crystal. In fact, in this case, the basic element of nonlinear 
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tensor for frequency conversion is d33 larger with respect 
d31, pertaining to the case in which both TM and TE 


polarized modes are involved. Another favorable point of 
view is that the gratings constructed by means of the poling 
technique are integrable with other waveguiding structures 
on the same substrate. 

In this work a high performance, double pass 
version of PPLN converter is proposed. An integrated 
metal mirror, placed after a phase shifter made of a 
waveguide segment, provides the enhancement of the 
cascaded second order susceptibility process. In other 
words, thanks to the mirror and the phase shifter, the phase 
shift among the guided modes is improved before the 
second pass of interaction. Moreover, the electrooptic 
effect, i.e the application of a static electric field on the 
segment region, is utilized to obtain a fine variation of the 
this phase shift and a further optimization. The time 
domain analysis, by using gaussian pulses in time domain, 
confirms the results obtained in the stationary investigation 


[3]. 


II. THEORY 


The device is constituted by the channel, PPLN, Z-cut 
waveguide shown in Fig.1. The waveguide is obtained via 
Proton Exchange (PE) technology (H:LiNbO3 
waveguides), thus only the extraordinary, TM polarized 
waves propagate because the ordinary refractive index 
change at the surface is negligible. In this case, as d33 is 
the dominant non-linear coefficient and with reference to 
the coordinate system of Fig. 1, the second order 
polarization is easily carried out, being 
P,, =2d33 E, (@,)E, (@}), where Pj are the components 
of the induced polarization, Ej are the components of the 
applied electric field and d33 is the second order nonlinear 
coefficients; @g and @p are two arbitrary frequencies. A 
grating is employed in order to obtain the phase matching 
between the fundamental and the second harmonic waves. 
It is constituted by alternatively inverted ferroelectric 
domains and has a rectangular profile which can be 
expressed via the Fourier series. The first order QPM 
condition between the fundamental and the second 
harmonic waves, for a waveform having duty-cycle equal 
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xr 
As sh’ MIRROR 


Fig. 1 Sketch of the wavelenght converter 


to 50%, leads to the calculation of the Fourier 
coefficients: C4; = cee oe The operation principle of the 
T 


investigated wavelength converter is here briefly described. 
The strong pump wave at fundamental wavelength Ap is 
launched in the input section; the QPM provides the 
generation of the second harmonic wave with the same 
polarization, at the half wavelength Ash = Ap/2. The 
Difference Frequency Generation (DFG) between the 
generated second harmonic and the pump waves produces a 
new fundamental wave. In the same way, the input signal, 
which has wavelength As = Ap + Adp slightly different 
from that of fundamental Ap interacts via DFG with the 
second harmonic producing the new converted signal which 
is the desired output, at wavelength: 
No =WU/Agy -1/A,). 

In the first pass of interaction, from the input section 
z=0=Z to the end of grating z = 2G (zy — Z ), the forward 
waves interact nonlinearly. The mirror, is integrated onto 
the substrate at the end face of the channel waveguide, at an 
opportune distance zy from the end of the grating. It is 
noticeable to observe that the waves propagate along the 
additional optical path, from the end of the grating z = zG 
to the mirror z = zjyq (ZG — Zy,) and, after the reflection 
from the mirror, to the grating (zg < Z),). In this way, a 
suitable choice of the initial condition for the second pass of 
interaction is obtained by calculating the more opportune 
length AzGM = ZM — ZG, i.e. the distance of the mirror from 
the grating. After the reflection on the metal surface, in the 
second pass of interaction (zg < Zg ), the reflected waves 
continue their power exchange. 

The coupled equation system, which models the nonlinear 
interaction, is carried out via the conventional coupled 
mode theory. The electromagnetic field is expressed as the 
superposition of the e.m. field profiles of the modes of the 


waveguide [1-3] at the different wavelengths. The pump, 
signal, converted signal and second harmonic wavelengths 
are indicated with Ap, As, Ao, Ash, respectively. The 
perturbed field amplitudes ap(z), as(Z), Ao(Z), ash(Z) are 
linked by a system of four differential equations. The time 
domains analysis is performed by considering the 
perturbated field amplitudes in the nonstationary case and 
re-calculating the equation system in the coordinates € =z 


and t= t-(z-B,), i.e in a frame moving with the group 


velocity of the pump wave. The obtained system is 
integrated via the Method of Lines (MoL). The modes of 
the diffused channel waveguide have been calculated via the 
SEIM (Simple Effective Index Method). The eigenvalue 
equation of each slab is obtained by imposing the continuity 
of the tangential components of the e.m. fields at the layer 
interface and with the help of the transformation matrix for 
the diffused region: the transformation matrix is 
numerically evaluated via the Gear-Predictor-Corrector 
Method (GPCM). For the single pass operation, a good 
agreement with the literature data has been found with equal 
physical and geometrical parameter values of device [4]. In 
the novel double pass converter proposed in this work, the 
initial condition for the second pass of interaction, depend 
on the length of the additional segment waveguide. The 
simulation is performed under the hypothesis that the waves 
are perfectly reflected by mirror and that the four waves do 
not interact via the cascaded second order process in the 
region out of the grating. 


QI. SIMULATION RESULTS 


The proposed device is constituted by Proton Exchanged 
(PE), post-annealed, lithium niobate channel waveguide 
(H:LiNbO3). The construction parameters are: temperature 


Tp= 200 °C and time tp=14 h of the exchange process; 
temperature Tg =350 °C and time tg=13 h of the thermal 
annealing. The extraordinary and ordinary refractive index 
profile ng and no, are semigaussian along the x-coordinate 
and gaussian along the y-coordinate; neg=2.1380 and 
Nos=2.2112 are the extraordinary and ordinary refractive 
indices of the substrate at the fundamental wavelength Ap = 
1.55 um; neg=2.1789 and nos=2.2586 are the extraordinary 
ordinary refractive indices of the substrate at the second 
harmonic wavelength Ash=0.775 um; Anes = 0.015 and 
Anos=0 are the extraordinary and ordinary refractive index 
changes at the surface. The in-depth and the lateral 
diffusion lengths of the waveguiding channel are d = 3.45 


tum and w = 1.5 um, respectively. The nonlinear second 
order coefficient is d33 = 31 pm/V. The case of perfect 


quasi-phase-matching between the fundamental wave and 
the second harmonic, gives a theoretical grating period 
A =16.8624 tm, by taking into account that Neffp=2.13961 
and neffsh =2.18557 are the calculated refractive effective 
indices of the rectangular waveguide at the fundamental 
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Fig.2 (a) Evolution of pump and of second hatmonic pulse in time 
domain. 


and second harmonic wavelength, respectively. The 


calculated stationary efficiency 
conversion 

2 2 
Np = 10log(ja, Zo >ZM.2Zm 20) LE (0) ) versus 


the AzGm length, for a channel waveguide zG=20 mm, 
2 
pump power la, (0) =400 mW at the wavelength Ap = 


1.55 wm and signal power la; @|°=1.5 mW at the 
wavelength A, =1.56 um eshibits a periodic behaviour. The 
maxima of efficiency lie at the abscissae AzGM max n 
=(2n+1) A/4, n=0,1,...., Moreover, if the mirror is placed 
directly at the end of grating, ie. for AzGnq=0, the 
conversion efficiency is lower than that of the single pass 
operation. The calculation performed in time domain, by 
using input gaussian pulses having Full Width at Half 
Maximum FFWHM =2.5 ps has lead to results very close 
to those obtained in the stationary analysis [3]. The 
conversion efficiency behaviour can be easily explained by 
considering that, forthe second pass, by changing the 


la, \? [mw] 


la, |? [mw] 


25 ® 
15 f 
¥ 


S [om] 


Fig.2 (b) Evolution of put and of converted pulse in time domain. 


distance AZGM, the initial conditions of the cascaded 
second order process significantly vary according to the 
period A. In other words, it is due to the periodic phase 
shift among the waves before the second pass of interaction 
obtained by varying the distance of mirror from the grating. 
Figs. 2 (a), (b) depict the evolution of the powers of 


the involved waves law e} , (w = p, i, 0, sh) versus the 


interaction length, calculated for one of the suitable distance 
from the grating to the mirror AzGyq =12.5 um, the pump 


2 
power isla, (0) =400 mW while the signal power at the 


wavelength A, =1.56 pum is la; (0)|7=1.5 mW. Both the 
pump and the input signal are pulses having full width at 
half maximum FREWHM =2.5 ps. In this case, a strong 
depletion of the pump wave occurs; this allows the growth 
of the other waves, their enhancement continues along both 
the forward (F) zy > zgand the backward (B)zp <- ZG 
pass. This is the case of the optimum phase shift among the 
waves. The group velocity of each wave is calculated via 
the Sellmeier equation. The stationary conversion efficiency 
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Ndp versus the signal wavelength A, for the double pass is 
calculated for different values of the grating length. The 
simulated curves are flat in the wavelength range 
1.52+1.58 jum, close to the fundamental wavelength. In 
particular for zG =25 mm, in the second maximum of the 
conversion efficiency AzGmM =12.5 um. the maximum of 
the stationary conversion efficiency is Ndp=22.86 dB, for 
the optimized double pass devices, i.e. almost the triple of 
Nsp= 8.20 dB, which is the value calculated for only the 


single pass. Hovewer, the conversion efficiency, in the case 
of unperfected QPM is lower with respect that calculated for 
the theoretical value A. The conversion efficiency, for a 
single pass device, having a grating length zG=20 mm, 


2 
fundamental power la, (0) =400 mW at the wavelength 


Ap=1.55 pm and signal power la, (0)|” =1.5 mW at the 


wavelength 1,=1.56 um is calculated as a function of A, it 
decreases from 5 dB to -14 dB by varying the grating period 
in the range A=(16.862 + 0.04) um, from the theoretical 
value A=]6.862 pm. An electrooptic refinement of the 
phase shifth of the interacting modes is proposed as a 
compensating technique for correcting the construction 
faults. Two electrodes are placed on the segment from the 
end of the grating to the mirror, i.e. from zG to zM, in the 


yz plane. By applying a static electric field Ed to the 
electrodes a variation of the impermeability tensor 


Ab;; occurs, according to the formula Ab;; =D By. 
ae 


where Ti are the electrooptic coefficients of the Pockels 
effect at constant strain uj. The variation of the dielectric 
tensor €j; Causes a variation of the optical paths and of the 


phase shift of the waves. The results are shown in Fig. 3 
which depicts the variation of the efficiency ngp versus the 


applied electric field Ee for the double pass version. The 
dashed curve (scale at left) are calculated for the electrode 
length Le =AzGm = 4 mm while the solid curve (scale at 
right) for electrode length Le = AzGM = 5 mm; the 
electrooptic coefficient is 133=30.8 pn/V; the signal 
wavelength is Ag = 1.56 um and the perfect quasi phase 


matching is supposed. It is worthwhile to noting that a 
variation equivalent to that illustrated by the dashed curve 
could be obtained by changing the distance from grating to 
mirror AzGy from 11.8 pm to 13.6 um. 


IV. CONCLUSION 


The use of an integrated metal mirror placed at the end of 
an additional waveguide segment strongly improve the 
cascade second order process in a PPLN waveguiding 
structure. In fact, for a grating length zG4= 25 mm, the 


CONVERSION EFFICIENCY ndp (48) 
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Fig. 3 Variation of the efficiency ngp versus the applied electric field hag 
for the double pass version. 


calculated double pass maximum efficiency is enhanced 
from 1sp= 8.20 dB to ndp=22.86 dB. These results are 
close to those obtained in time domain analysis. To reduce 
the mismatch between the fundamental and second 
harmonic waves the the electrooptic effects is exploited.. 
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Demonstration of a Fabry-Perot active switch 
by the bidirectional MoL-BPM 


Antonella D’Orazio, Marco De Sario, Vincenzo Petruzzelli, Francesco Prudenzano, Leonardo Sassi 


Abstract—Aim of this paper consists of investigating the 
switching behaviour of a Fabry-Perot (F-P) active switch by 
means of an improved Beam Propagation Method based on the 
Method of Lines (MoL-BPM). The contribution of the 
longitudinal charge diffusion and the spreading of the injected 
current in the active region is considered, too. The reflection of 
the optical signal on the terminal facets is accounted by means 
of the bidirectional version of the MoL-BPM. The F-P switch 
length is only 200 tm, i.e. less than a fifth of the corresponding 
Traveling Wave (TW) switch (1100 um). 


Index terms—Fabry-Perot, rate-equation, charge-diffusion, 
crosstalk. 


I. INTRODUCTION 


New active waveguides, made of  chalcohalide 
praseodimium (Prt ) doped glasses! justify the considerable 
interest in the simulation of active devices. Under a 
financial support of Italian CNR _ project, we are 
demonstrating the feasibility of Prt doped glass 
waveguides made by pulsed laser deposition and when we 
will have the complete parameter set of chalcohalide glass 
waveguides we will be able to apply to these new and 
intriguing materials the proposed methodology which is, 
here, devoted to semiconductor devices. In fact, over the 
past years various models were introduced for the design of 
lasing semiconductor devices in order to account the 
intricate phenomena stemming from the current injection in 
an active medium.23 Substantially, the interaction between 
the electromagnetic field and the carrier concentration was 
solved in a self-consistent way as a boundary condition 
problem in the transverse section of the waveguiding 
structure. In this way, the interaction along the longitudinal 
direction, such as the influence of the mirror on the field 
propagation and any effect of the optical power on the 
propagation losses, was disregarded. On the other hand, 
with the aim of taking into account the longitudinal 
phenomena, recently several authors used the conventional 
coupled-mode theory for the design of active twin guide 
semiconductor devices both under electrical and optical 
control.4> This approach allowed for developing a model 
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for the active twin guide able to include the effects of the 
resonant cavity in the F-P version of the active switch, and 
for analyzing the coupling behaviour as a function of all the 
geometrical and physical device parameters, taking into 
account even complicate effects, such as the diffractive 
coupling between the optical fields, the gain saturation and 
the dispersive nonlinearity of the active layer. However, the 
coupled-mode formalism suffers the approximation of weak 
coupling, whereas short interaction length L could be 
obtained only by increasing the coupling coefficient C, the 
CL product being equal to 7/2. Therefore, in the cases of 
guides out of synchronism or strictly close each other, 
improved coupling theories have to be applied.®-7 A more 
appropriate approach to model the behaviour of active 
devices taking into account the interaction along the 
longitudinal direction and, then, to evaluate the evolution of 
the electromagnetic wave interacting with the active matter, 
makes use of the Beam Propagation Method based on the 
Fast Fourier Transform (FFT-BPM). First results were 
reported in reference 8 for the modeling of static double- 
heterostructure lasers. More recently the authors have 
investigated by means of the BPM based on the Method of 
Lines (MoL) the performance of an InGaAsP/InP twin guide 
as TW switch under electrical? and optical!® control. Owing 
to its semi-analytical approach, the MoL-BPM algorithm 
gives more accurate results with respect to those of the 
FFT-BPM. Moreover, a non-uniform discretization step 
along the transverse dimensions can be chosen by the MoL- 
BPM code, thus allowing a more rational use of the 
computer memory with a substantial reduction of the CPU 
computer time. We found good performance for switch 
length L=1100 um under electrical control and L=1485 um 
under optical control. In this paper, we apply the MoL-BPM 
algorithm to investigate the performance of a F-P active 
twin guide switch under electrical control. To this aim, the 
bidirectional version of MoL-BPM algorithm (MoL-BBPM) 
has been implemented in order to account both the forward 
and the backward waves in the charge density rate equation, 
so the influence of the optical feedback due to the presence 
of the terminal facets is evidenced.!!+!?. 


II. ANALYSIS 


To simplify the three-dimensional waveguide problem, we 
apply the refractive effective index method to the twin guide 
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coupler of Fig.1, in order to eliminate the x variable. In this 
way, the rate equation pertaining to the charge density 6 
assumes, for a stationary analysis (6 /6t=0), the following 
expression: 


E,(y.z)” +Ao(y.z)+Bo7(y,z)+Co'(y,z) (1) 


Fig.l: Twin guide switch. 


where: D, is the charge effective diffusion coefficient; J 
is the injected current density; dy is the depth of the active 
layer; g(y,z)=a O(y,z)-b is the approximated value of the 
linear local gain; h is the Planck constant; E,x,z) and 
E,(x.z) are the forward and the backward electric fields, 
respectively, calculated by the MoL-BBPM and normalized 
in [W/cm]!/2. The last three terms in Eq.(1) are the non 
radiative, radiative and Auger recombination effects, 


respectively. The physical data used in Eq.(1) are reported 
in reference 9. 


The TE-like electromagnetic wave is calculated in both the 
longitudinal forward and backward directions by the 
following expression by means of the MoL-BBPM: 
37E, 
dy? 
with n,(y,z)=N,_(y.Z)+B.0(y.z)+j(g/2k9), where ny is the 
passive refractive index o: the original structure without 
charge injection and 8, is the antiguiding coefficient which 


is negative for the InGaAsP/InP heterostructure at A=1.55 
um. ; 


+k n?(y,z)-B° |E, = (2) 


The MoL-BBPM algorithm is based on two operators and 
works in two steps. The first operator describes the one-way 
propagation (the same of the classical BPM), while the 
latter determines the reflection from a generic discontinuity 
along the longitudinal direction z, thus allowing the 
application to more complex laser structures such as DFB, 
DBR, and so on. In the first step these two operators are 
used in the forward propagation to calculate the transmitted 
and the reflected fields at each discontinuity along the 
structure. Successively, in the second step, the backward 
propagation is implemented and the new transmitted and 
reflected fields are evaluated, accounting the reflection 
contribution calculated in the previous step and summing at 
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each computational longitudinal section all the forward and 
the backward waves. This procedure is applied iteratively 
till to obtain a good convergence. The convergence criterion 
is based on the comparison between the field at the output 
section calculated in the two successive iterations. In each 
discontinuity section along the longitudinal axis z, the MoL- 
BPM code assumes for the wave equation along each line 
the following expression for the vector of the scalar 
potential g, in the transformed domain: 

S% 13.5, =0 (3) 


a 


dz 


By analytically solving Eq.(3) we obtain: 


@@=he "72 +B,e"” (4) 
where F, e*? and B, é? are the contributions of the 
fields in the transformed domain along the forward and the 
backward directions, respectively. Zk is the normalized 
longitudinal axis. The field 9,(Z) in the original domain 
can be obtained by the inverse domain transformation. In 
order to obtain the reflection operator, Eq.(3) is solved 
analytically for the forward and backward traveling waves 
in both the left I and the right II regions of Fig.2, that 
sketches the generic discontinuity section at z=0. 


Region I Region II 


z=0 
Fig.2: Forward and backward waves at the discontinuity 
interface. 


After having calculated the coefficient F; by Eq.(4), the 
unknown coefficients B,;, Fy and By are evaluated by 


matching the electromagnetic field transverse components at 
the discontinuity interface z=0. 


The simultaneous solution of Eqs.(1) and (2) solves the 
propagation problem of the active twin-guide, thus allowing 
the calculation of the switching performance in terms of 
crosstalk and gain at the output ports 3 and 4 of Fig.1. 


This model accounts also the effects of the charge 
longitudinal diffusion Gora? - 0) and of the spreading of 


the injection current in the active region. For evaluating the 
first effect Eq.(1) is solved as regards the longitudinal o 
variations by means of a finite difference scheme and by 
using an iterative procedure with successive 
approximations. The method converges after only twenty 
iterations. Moreover, for the second effect we assume in 
Eq.(1) a non uniform distribution of the current density in 
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the active region with a constant value under the electrode, 
very small but different from zero out of this. 


III. NUMERICAL RESULTS 


The twin guide is characterized by the following physical 
and geometrical data: ridge width w=2.4 um, ridge height 
d,=4.6 um, active layer depth d,=0.2 tim, cladding depth 
d,=0.2 um, waveguide gap s=2 tm, refractive index of the 
cladding and the substrate n.=n.=3.17, refractive index of 
the active layer ng=3.38 at the vacuum wavelength A=1.55 


tum. In order to establish the appropriate values of the 
injection currents I,;4=I,=1, and I,,=I,=I, for the switching, 
a preliminary analysis has been carried out for one of the 
two ridge waveguides of Fig.| acting as laser F-P. In fact, 
this analysis is necessary to calculate the threshold current 
value for the laser effect. To this purpose we calculate by 
the MoL-BBPM the laser power-current characteristic curve 
by changing the current from 20 mA to 870 mA for a F-P 
laser length L=200 um. In our calculations we have choose 
equal reflectivity p=0.3 for the two terminal facets and a 
longitudinal MoL-BBPM calculation step Az=0.5 pum. 
Moreover, a non-uniform transversal discretization is used: 
Ay=0.01 tm and Ay=0.5 um for the inner window 20 um 
large and for the outer calculation window 30 um large, 
respectively. Fig.3 shows the power-current characteristic of 
the F-P laser with length L=200 um. We see that the laser 
effect occurs for an injected threshold current I,,=30 mA. 
In fact, as expected, for higher current values the output 
power linearly increases. Moreover, we verify that the 
presence of both the ridge waveguides does not give a 
change of the I,, value. For this reason we choose for the 
two injected currents I, and I,,4 of the twin guide switch of 


Fig.l, values around the threshold one. In particular, for all 
the MoL-BBPM simulations, the sum I7=I,3+1,, of the two 
currents is constant and equal to 60 mA, while changing the 
T ratio: 
paetoa = bs 

ly 
In fact, for I; greater than 21, the gain is so large to have a 


laser behaviour. 


Figs.4 and 5 depict the MoL-BBPM calculated crosstalk 
CT=10 log(P,3/P,4) and the gain G=10 log(P,q/P;,) as a 
function of the current ratio r for a switch length L=200 um. 
P;, being the optical power launched in the input port 1, P,3 
and P.4 are the output powers emerging from the port 3 
and 4, respectively, while Py is equal to P,3 or Pq for 
the bar state or the cross state configurations, 
respectively. More precisely, Fig.4 shows the calculated 
CT values for r ranging from —0.05 to 0.067. We see that 
for r within the intervals -0.035+-0.027 and 0.017+0.067 
the CT is positive because the switch assumes a_ bar 


state configuration (CT), whereas for the other r values the 


crosstalk is negative and the F-P switch is in the cross state 
(CT,,). However, we find that the best CT), occurs for 


t=-0.035+-0.028 and r=0.042, while the best CT.,occurs 
for r=-0.025+0.005. 


PB, mW] 


I ma 


Fig.3: Power-current F-P laser characteristic for L=200 um 
calculated by the MoL-BBPM. 
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Fig.4: Crosstalk of the F-P switch as a function of the 
current ratio r. 
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Fig.5: Gain G of the F-P switch as a function of r: bar state 
configuration (dashed line), cross state configuration 
(solid line). 
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On the other hand, Fig.5 depicts the pattern of the gain G 
for the two switching configurations: G,, for the bar state 


(dashed line), G., for the cross state (solid line). We can 
observe that in the range of better crosstalk (r=-0.035+- 
0.028) for the bar state configuration the gain G,. is about 
equal to 9 dB, while for r=0.042, where CT,,.=12.4 dB, G,, 
assumes the maximum value equal to 12.3 dB. Therefore, 
this last configuration of the currents 113 and I54 (r=0.042) 
shows the best bar state behaviour. Moreover, in the r range 
of the best CT,, values (r=-0.025+0.005), G,, decreases 


from 12 dB to 9.6 dB. However, in summary, the best cross 
state configuration occurs for r=0 for which CT..=-20.9 dB 


and G,.=9.6 dB. In this case, in fact, even though the gain is 


not the maximum value, the crosstalk exhibits best 
performance. 


Figs.6 and 7 illustrate the evolution of the total field 
amplitude E=E;+E, along the propagation axis z calculated 
by means of the MoL-BBPM for the two best configurations 
of the bar state and the cross state, respectively. Fig.6 shows 
that, for r=0.042, the total bidirectional electric field 
assumes a Stationary wave pattern and the optical power 
launched from the port | emerges at the output section of 
the twin guide coupler from the port 3. 


E 
10 


Fig.6: Field evolution of the total field amplitude along the 
F-P twin ridge waveguide coupler for r=0.042: bar 
state configuration. 


Fig.7: Field evolution of the total field amplitude along the 


F-P twin ridge waveguide coupler for r=0: cross state 
configuration, 
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On the other hand, in Fig.7 it is evident that, for a different 
current ratio value (r=0, I,3=I,4=30 mA), the launching 


field emerges from the other port 4 assuming a cross state 
configuration. 


IV. CONCLUSION 


We have simulated the switching performance of a F-P twin 
guide coupler. In fact, the bar state has been obtained for 
r=0.042, that is I,,=28.74 mA and I,,=31.26 mA, with a 


crosstalk CT,.=12.4 dB and gain G,,=12.3 dB. Moreover, a 


good cross state configuration has been found for r=0, that 
is I,3=I,4=30 mA, with CT,.=-20.9 dB and gain G,.=9.6 


dB. The advantage of the F-P active switch, with respect to 
the analogous TW switching configuration, has been put in 
evidence with a drastic reduction of the device length from 
about 1100 um to about 200 um. 
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FDTD SIMULATION OF ALL-OPTICAL 
SWITCHING USING DIFFERENT WAVELENGTH 
CONTROLLING PULSE IN THE NLDC 


Natasa Trivunac-Vukovic, Member, IEEE, and Bratislav Milovanovic, Member, JEEE 


Abstract-- Finite difference time domain method (FDTD) is 
proposed for solving the nonlinear coupled Schrédinger 
partial differential equations, which describe the soliton 
propagation in the nonlinear directional coupler (NLDC). 
Ultrafast all-optical switching using NLDC and solitons is 
described. The switching is controlled by copropagating a 
different wavelength controlling soliton pulse in the adjacent 
fiber. 


Index terms-- finite difference time domain method, nonlinear 
directional] coupler, controlling solitons 


1. INTRODUCTION 


The application of the NLDC and optical solitons in 
ultrafast all-optical switching has been widely investigated 
[1], [2], [3], [4]. All-optical switching devices can operate 
at speeds larger than conventional electronic or 
optoelectronic devices, ‘and from that reason they are 
considered to be very attractive. The NLDC is a device 
with four optical ends which is usually constructed of two 
coupled optical fibers. The cores of those two fibers are at 
the determined length so they can exchange energy. The 
coupling length can be chosen so that a low intensity signal 
can be completely coupled from one core to another after 
travelling a certain length. They are usually asymmetric in 
a way that the nonlinearity effect is stronger in one optical 
fiber. In such an asymmetric coupler, the output power 
appears in the fiber with stronger nonlinearity effect when 
only one fiber is excited. When both fibers are excited, the 
power distribution at the output end depends on the relative 
phase of the input signals. If a high intensity signal is 
launched into one of the cores, the propagation constant is 
changed in that core due to the nonlinear dependence of 
refractive index or intensity [2], [3]. From that reason the 
two cores become uncoupled because of the phase 
mismatch. Since the switching is controlled by the 
instantaneous value of the intensity, the low and high 
intensity parts of the same optical pulse will be directed to 
different output ports and that can seriously degrades the 
performance of the NLDC. It has been presented [4] the 
solution of this problem using controlling optical solitons, 
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which can propagate with a signal soliton while its shape 
remains unchanged. The main function of the controlling 
solitons is to provide that NLDC does not reroute the signal 
into different port and in that way the controlled switching 
is obtained. In Fig. 1 the schematic block diagram 
describing all-optical switching using NLDC and 
controlling solitons is shown. 


mESSICSE ee S 
Pulse DSF ee ey Ts Optical | 
Source Vip | Receiver | 


S$ DSF 


NLDC 


Fig. 1. Schematic block diagram of all-optical switching 
using NLDC and controlling solitons (DSF - dispersion 
shifted fiber, SS - signal soliton, CS - controlling soliton) 


The optical soliton propagation in the NLDC can be 
described by a pair of coupled nonlinear Schrodinger 
differential equations. The aim of this paper is to present 
an efficient and simple procedure for solving the nonlinear 
coupled Schrédinger differential equations based on FDTD 
method, which provides observation of all-optical 
switching using NLDC. FDTD simulation of soliton 
propagation in the optical fibers has been already shown 
[5]. This method can be used to transform the 
initial/ooundary value problem associated with the 
nonlinear Schrédinger equation, into a first order, linear, 
initial value problem. Four different switching schemes 
where controlling solitons may be differently polarized 
(orthogonal polarized), and when they have different 
wavelength have been already analyzed [6] and it has been 
shown that the best switching performance could be 
obtained using different wavelength controlling pulse in 
the adjacent fiber of the NLDC. Due to that reason, this 
switching scheme will be analyzed in details, with respect 
to the value of the amplitude of the controlling pulse, the 
chosen wavelength of the controlling pulse and the width 
of the controlling pulse. 


Il. THEORETICAL MODEL 


The optical soliton propagation in the NLDC can be 
modeled by a pair of coupled nonlinear Schrodinger 
equations in the normalized parameters [7], [8]: 
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where u,,(&,7) are the envelopes of optical pulses in both 


channels (fibers) of the NLDC, which are functions of 
distance and time, & is normalized distance, Tt is 
normalized temporal coordinate, k, is a coupling coefficient 
and q is a parameter which describes the asymmetry of the 
NLDC (we will assume that g=1). Those parameters are 
defined as follows: 
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Ais a wavelength of the optical carrier, k is a wave number 
in the optical fiber. The normalized coupling length is 
defined as follows: 
1 
iS Cf, 5) k. 
The switching scheme that we observe, where the 
controlling pulse has a different wavelength and the same 
polarization as the signal pulse, is described by four 
coupled equations, as follows: 
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u;,2 and v, represent the signal and controlling pulses, 
respectively. The parameter 5 is the group velocity 
mismatch due to the difference in the central frequencies of 
the pulses. The fifth terms account effect of cross phase 
modulation. D,, and D, are the dispersion parameters of the 
optical signal mode u and optical control mode v. The 
frequencies are chosen to be in the different dispersion 


regions, positive and negative, so the third term in the Eggs. 
(9) and (10) which corresponds to controlling pulses, has 
negative sign. The losses and third order effect are 
neglected in this analysis. 


lll. ANALYSIS METHOD 


The Eqs. (1) and (2) and (7) — (10) have to be solved 
numerically. The efficient way for solving the coupled 
nonlinear Schrodinger equations by finite difference 
method has been proposed [5]. The first step in the analysis 
is the substitution of the variables x= and f=r. Time 
domain discretization is made with step Af, where the 
corresponding index is n, so that nAt=t (n=0,1,2,...). 
Similarly, index i corresponds to the discretization in 
space, so iAx=x (Ax is a mesh resolution, i=0,1,2,...1). 
Discretized functions w)(x,t), u2(x,t), vi(x,)), v2(x,t) are 
denoted aS Win, U2in, Viin and v2,, respectively. The 
forward Euler finite difference scheme is applied. After 
applying the finite difference method, the implicit form of 
the Eq. (7) is given: 
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The equations (8)-(10) can be derived analogously. For the 
purpose of computer simulation, all equations should be 
explicit with respect to w)i+).n, U2i+1.n) Viitin ANd V2i+1 9. 


IV. SIMULATION RESULTS 


The switching performance of the NLDC will be 
numerically investigated by FDTD method. The von 
Neumann stability analysis was carried out and the 
discretization steps are determined. The mesh resolution is 
chosen to be Ax=0.002 and the time domain step Af0.2. 
The assumption is that 6=0 because of the simplicity of the 
analysis, and coupling coefficient k=1, so the normalized 
coupling length is €&=7/2. In the case when the controlling 
pulse has a different wavelength and the same polarization 
as the signal pulse, for the case of dispersion shifted fiber, 
A, is chosen to be 1560nm and A, is chosen to be 1540nm, 
with the corresponding dispersion —_ parameters 
D,=\ps/(nmkm) and D,=-1ps/(nmkm). 

It has been shown [8], [9], in generally, the solution of the 
nonlinear Schrédinger differential equation in the form: 


u(x,t) = nsech(n(t—t)))-exp(jin?x/2) (12) 


where 7 is the scaling parameter. By using this closed form 
solution, it is possible to assume the initial pulse for x=0, 
as follows: 


u(0,t) = sec h(t —t)) (13) 
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The initial conditions for the switching scheme under 
consideration (for x=0) are: u,(0,t)=sech(t); u2(0,)=0; 
v,(0,t)=0; v2(0,t)=Asech(t). The input and output pulse 
envelopes after the coupling length will be shown. It was 
already mentioned that the function of the controlling pulse 
is to provide that the NLDC doesn’t reroute the signal to 
the other fiber as it would be if the controlling pulse were 
absent. From that reason, the pulse envelope at the output 
of the same fiber of the NLDC is observed. We will show 
that by using controlling solitons it is possible to obtain the 
unchanged shape soliton signal at the output of the same 
fiber of the NLDC. 

In Fig. 2 the normalized energy versus controlling soliton 
amplitude A at the output of the channel u,, is shown. This 
dependence describes the switching performance of the 
NLDC with respect to the controlling signal amplitude A. 
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Fig. 2. Normalized energy versus controlling signal 
amplitude A, at the output of the channel u, 


From Fig.2 it can be seen that the best switching 
perfomance can be obtained for A>4.2, We will choose 
Amax iS 6, What corresponds to the maximum energy 
(almost 90%) at the output of the NLDC. 


t 
Fig. 3. Envelopes of input pulse and pulse at the coupling 
length in channel uw; (when Aj,a:=6) 
a— input, b— at the coupling length 


In Fig. 3 the envelopes of input pulse and the pulse at the 
coupling length are shown, for this switching scheme when 
Amax=6. The signal has the envelope that coincides with the 
fundamental soliton inputted to the channel wu). 


Fig. 4. Soliton propagation in channel wu; (when A;max=6) 


The main function of the controlling soliton is 
demonstrated in Fig. 4. Controlling soliton provides 
propagation in the same fiber where the signal was 
launched. Signal soliton does not reroute to the second 
fiber when it was launched into the first fiber. This scheme 
is analyzed for other values of A, when AA pox. 


Fig. 5. Envelopes of input pulse and pulse at the coupling 
length in channel wu, (when A mar~A=3) 
a— input, b — at the coupling length 
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Fig. 6. Envelopes of input pulse and pulse at the coupling 
length in channel wu; (when A max#A=5) 
a— input, b — at the coupling length 


In Figs. 5 and 6, the signal envelopes, after the coupling 
length for A4=3 and A=5, are shown. It can be concluded 
that for different values of A, the signal at the output of the 
channel uw, has the soliton envelope and gives good results. 
By decreasing the amplitude value A, the switching 
performance are degraded. 
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This scheme was analyzed with respect to wavelength of 
the controlling pulse. 
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Fig. 7. Envelopes of input pulse and pulse at the coupling 
length in channel u; (when A,,a.=6, 2,=1530nm) 
a— input, b — at the coupling length 


Fig, 8. Envelopes of input pulse and pulse at the coupling 
length in channel uv; (when Aj,a:=6, 7,=1555nm) 
a— input, b — at the coupling length 


In Figs. 7 and 8 the envelopes of input pulse and pulse at 
the coupling length for chosen A,,2.=6, A,=1530nm and 
A,=1555nm, respectively, is shown. The wavelength of the 
signal soliton is chosen to be A,=1560nm. It is obvious that 
greater difference between signal and controlling soliton 
leads to greater envelope cegradation. 

The main drawback of this switching scheme is the 
necessity to have large amplitude controlling pulse, to 
obtain better switching performance. From that reason, we 
analyzed the influence of controlling pulse width on 
switching performance. In order to reduce the peak 
amplitude A of the controlling pulse we have increased the 
pulse width and in that way the same energy at the output 
could be obtained. The switching performance are analyzed 
for this scheme, with the controlling pulse Asech(wt), for 
values w=1;0.5;0.2. The smaller value w corresponds to the 
broader pulse. In Fig. 9 the switching performance versus 
amplitude A, where w is the parameter, is shown. 

From Fig. 9, the optimal values of A and pulse width w that 
corresponds to the maximum output energy can be 
determined. By increasing the controlling pulse width the 
controlling soliton amplitude can be reduced even to value 
A=2.8 (for w=0.2), keeping the maximum energy at the 
output. 
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Fig. 9. Normalized energy versus controlling signal 


amplitude A, at the output of the channel u,, w - parameter 


v. CONCLUSION 


A computer simulation, based on FDTD method, of 
controlled all-optical switching of a soliton signal to the 
output ports of 2x2 NLDC is shown. The routing is 
controlled by simultaneously propagating different 
wavelength soliton pulse in the adjacent fiber of the NLDC. 
This switching scheme has been analyzed with respect to 
the amplitude of the controlling pulse, chosen wavelength 
and width of the controlling pulse. The results show that 
the output signal is almost distortionless and the optimum 
amplitude can be obtained giving a very high switching 
efficiency (of almost 90%). Also, the analysis with respect 
to wavelength shows that greater difference between signal 
and controlling soliton leads to greater envelope 
degradation. As the way to reduce the peak amplitude of 
the controlling soliton what is the main drawback of this 
switching scheme, it is proposed the increasing of the 
controlling pulse width, while energy remains the same. 
FDTD method is an efficient tool for solving the nonlinear 
coupled Schrodinger differential equations and can be used 
for observation of all other effects of pulse propagation in 
nonlinear dispersive medium. 
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HARDWARE IMPLEMENTATION OF 
MULTIMEDIA DRIVEN HFC MAC 
PROTOCOL 


H.-C. Leligou, J. Sifnaios, G. Pikrammenos* 


Abstract-- Tree-shaped topologies present attractive cost 
advantages for broadband access networks by allowing 
many customers to share the expensive head-end 
equipment and the feeder section and provide a graceful 
upgrade path towards the photonization of the local loop. 
In addition they offer reuse of the copper last drops to the 
customer at least during the crucial introductory phase 
and probably for many years to come. Typical examples 
are Hybrid Fiber Coaxial (HFC) and Passive Optical 
Network (PON) systems. The TDMA multiplexing of 
traffic entering such a system is governed by the MAC 
protocol, which arbitrates the allocation of bandwidth to 
the shared feeder. At the same time the need to integrate 
telecom services presenting different quality requirements 
with plain best effort services over the same infrastructure 
brings new issues to the design of such an access 
mechanism. The MAC protocol as the only arbiter of the 
upstream bandwidth directly affects the QoS provided to 
each upstream traffic flow and must meet several 
constraints. Such constraints include the adequate speed of 
operation exploiting in the highest degree the speed of H/W 
implementation, flexibility to support efficiently the largest 
number of services and applications offering an adequate 
number of QoS classes and independence of higher layers, 
protocols and future extensions to traffic management 
specifications. The implementation of a MAC protocol 
targeting these goals in the framework of the AROMA 
research system is presented in this paper. We discuss the 
details and the implementation cost of the solutions 
followed and we evaluate the implemented mechanisms 
using computer simulation. 


Index terms—HFC, MAC, QoS, FPGA controller 


I. INTRODUCTION 


Hybrid Fiber/Coaxial (HFC) systems, re-using the 
existing infrastructure of CATV systems and their 
coaxial medium beyond the fiber node, reduce the cost of 
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the transition to a broadband access network offering a 
graceful solution in the cost-sensitive residential access 
domain. In such shared access systems, sharing the 
upstream channel (from customer to network) is usually 
effected through TDMA multiplexing. From a traffic 
engineering point of view, the downstream direction 
exhibits a broadcast nature while in the upstream, the 
traffic from all active modems is multiplexed/ 
concentrated as in the output port of any switching node. 
To arbitrate the access to the slotted common medium 
according to the demand, a Medium Access Control 
(MAC) has to be employed. 
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Figure 1. HFC system 


Bandwidth allocation to the system users may be static, 
based on a pre-assigned unsolicited number of upstream 
time slots, or on demand, based on a request-grant 
exchange protocol. Different standard organization [4], 
[5], [6] have proposed several mechanisms to support all 
possible bandwidth allocation schemes trying to optimize 
the trade-off between the flexibility and performance, 
and the overhead and complexity of the MAC protocol. 
The allocation of upstream slots in the AROMA system 
is based on a reservation method which allows to 
dynamically adapt the bandwidth distribution to traffic 
fluctuations. The implementation of mini-slots [8] in the 
physical layer in order to efficiently support contention 
mechanisms was not followed, since the focus of the 
project was in providing QoS guarantees and predictable 
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performance to differentiated services. Contention 
mechanisms for access to the upstream channel even in 
mini-slots cannot guarantee worst case performance and 
could optionally be employed as a companion 
mechanism for not time critical services [7]. Polling of 
requests was 

instead employed providing the opportunity to trade-off 
utilization for low reaction time and controlled delay. 
The MAC controller collects access requests and then 
allocates the upstream slots by sending access permits. 
This results in a distributed queuing system 
characterized by the long time required to pass control 
information from the queuing points to the service 
controller residing at the head-end. It is important to 
note that the service policy of the MAC governs the 
distributed multiplexing from a central point situated 
miles away at the head-end. This has the important 
implication that, as regards delay control, the acquisition 
of arrival information and buffer fill levels includes a 
considerable delay element not found in the centralized 
multiplexing typically operating in a switch queuing 
point. In contrast, drop policies must be distributed over 
all network terminations (cable modems) where the flow 
identity and fill levels are known. 


The need to differentiate the handling of traffic classes 
became apparent by the emergence of requirements for 
ISPs to offer QoS to certain flow in the presence of 
disturbing plain best-effort traffic without complex 
maintenance of state information in the core routers. It 
was obvious that it was not acceptable to mix delay- 
sensitive and best-effort services without unacceptable 
loss of grade of service for the former class, which 
employed preventive flow control. In contrast TCP/IP 
traffic uses flow control on the transport layer based on 
the detection of losses. Thus a TCP flow increases 
gradually the rate submitted to the network up to the 
point that buffer overflow occurs and re-transmission is 
triggered. This action will disturb the other traffic 
classes, which can not berefit from re-transmission. 


As the MAC protocol requires a significant hardware 
part, a flexible strategy that can deal with behavior 
aggregates, guarantee quality to demanding traffic and 
reduce complexity is needed. The basis of the approach 
presented is the use of access priorities in the reservation 
system. Logically separate queues for each priority 
connection are necessary in the cable modem side for the 
proper operation of the prioritization scheme. 


Services with a similar set of requirements can be 
grouped into 4 behaviour aggregates (classes) supported 
by 4 different priority connection: 

e All delay sensitive periodic Constant Bit Rate traffic 
is treated as the highest priority traffic (1" priority) 
and is supported by unsolicited permits scheduled 
during the connection establishment stage leaving 


the three lower priorities employ the ‘reservation 
scheme. 

e Real-time Variable Bit Rate flows, such as video 
services are grouped into the 2° priority of the 
presented algorithm. The 2° priority requests are 
inspected first, preventing a possible disturbance of 
the two lower priority traffic that also employ the 
reservation scheme. At the same time, a peak-rate 
policing mechanism guarantees that malicious users 
cannot disturb the system performance. 

e The third priority traffic class is suitable for services 
requiring a better than best-effort behaviour. A 
minimum cell rate is guaranteed for this priority, 
while traffic exceeding this rate is relegated to the 
4" priority. 

e Plain best-effort services are grouped into the 4" 
priority, relying their rate adaptation to their loss 
based flow control mechanisms at the TCP level. 

Prioritizing the different requirements connection 

classes, QoS can be guaranteed to the services that need 

it and TCP/IP services are supported taking advantage of 
the statistical bandwidth multiplexing without disrupting 
other services. In other words, if we did not have the 
lower class we would have either to accept less 
demanding- quality traffic or leave the excess bandwidth 
unutilized in an effort to avoid performance degradation 
during statistical extremes. It is the availability of closed 
loop congestion control of TCP that allows us to offer 
both guaranteed performance to demanding traffic and 
good system resource utilization, at the same time. This 
results to a bigger number of services support and a more 
efficient bandwidth exploitation. The presented approach 
is in line with both the different classes of service 
according to the ATM forum philosophy and the 
Differentiated Services strategy of the IETF [1], [2],[3]. 


II. IMPLEMENATION OF THE MAC CONTROLLER 


The MAC controller was implemented on the Access 
Network Adaptation (ANA) board of the AROMA 
system. Since the MAC layer is considered a sub-layer of 
the physical layer and is closely related to the physical 
layer functions (TDMA, framing, etc.), the permit 
generation function was selected to be integrated with 
the downstream framer component, which was 
responsible for the construction of the downstream frame 
including the transmission permits. The whole design 
was programmed and placed on a Field Programmable 
Gate Array (FPGA) chip. This decision made possible 
the re-design of the implementation to adjust to modified 
frame structures, rates etc. An embedded processor (On 
Board Controller) was used to calculate and modify all 
non-real time parameters such as the programmed (pre- 
allocated or provisioned) bandwidth distribution, which 
may vary with time due to the switch on or off of the 
modems. An external static RAM chip was used for 
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buffering necessary protocol information in addition to 
the available on-chip memory provided by the FPGA, 
which was exploited to keep state information and 
speed-up the permit generation process. The MAC 
algorithm works as follows: 

The OBC executes the Call Acceptance Control (CAC) 
scheme and prepares a list of 512 permits as well as a 
credit list. The 1* and the 2"¢ priority connection service 
at the constant and minimum bit rate respectively is 
supported by pre-allocated unsolicited permits written by 
the OBC at the time of the subscription in the permit list. 
The modems that have established such priority 
connections do not need to be polled since the 
programmed permits will enable the announcement of 
new arrivals at any queue. Polling permits are inserted 
for modems that have not established a 1" or 2" priority 
connection. A variable number of positions in the permit 
list are left empty. Techniques to space the permits in the 
list are given in [9]. At the same time to support the 2" 
priority connection service up to the peak-rate and to 
guarantee that the 3“ priority connections will achieve 
their guarantied minimum rate, credits are allocated and 
grants will be issued on request basis. Both the credit 
and the permit lists are stored on the external SRAM 
chip. Since ATM signaling is also supported in 
AROMA, the permit and credit lists can be updated 
dynamically to add new connections using a second 


copy. 


Figure 2: MAC design 


An empty position in the list (representing unallocated 
bandwidth) enables the dynamic service of the three 
lower priorities, which is based on requests. The 
requests per queue/priority arrive at the MAC controller 
in the head-end piggy backed in the upstream 
transmitted cells. The 1-byte wide piggy back field has 
been segregated in 3 parts, to offer the chance to all the 
three lower priority queues to announce the arrival of 
new cells, albeit with a delay equal to the time for the 
propagation of the information. This information is used 


to increment the 2 outstanding request totals for each 
modem and queue, which are stored in the 16 bit wide 
external SRAM creating a reflection of the 
corresponding buffer fill levels. Due to the employed 
round-robin service strategy, the piggybacked request 
buffer should be accessed frequently and traversed in 
order to find the next permit that will be scheduled, To 
avoid a redundant number of accesses to the external 
SRAM, which is time-consuming, and to expedite the 
search process, on chip flags are used to quickly detect 
and skip all empty locations. 

The permit generation process results in the construction 
of a field pre-pended in each downstream frame, which 
includes all the grants for the corresponding time 
interval. The permit list is cyclically read out by the 
MAC controller H/W. The scheduled permits are 
inserted in the permit field and the empty positions are 
filled with permits produced by an engine scanning in a 
round robin fashion the outstanding request counters and 
reducing them by one for each permit scheduled. The 
higher priority counters are inspected first and only if all 
are empty the same process is repeated for the 
immediately lower priority. The permit generation is 
subject to some additional checks and guarantees by 
force of a companion credit check process. The MAC 
H/W subtracts the credits as it issues permits and stops 
serving any modem queue that exceeded its allocated 
apportionment. This policing action guarantees that 
malicious users can not disturb complying traffic 
exceeding their contracted peak cell rate. The credit 
check is used differently in the 3“ priority: it guaranties 
a minimum cell rate service, since a programmed 
number of credits guaranties the existence of the 
corresponding unallocated bandwidth. When 3“ priority 
credits are exhausted, left-over requests are added to the 
4" priority ones and treated as plain best-effort traffic 
request, in accordance with AF rules. The operation of 
the permit scheduler and the execution of the MAC 
algorithm are depicted in Figure 2. 

It worth noting that the permits sent downstream 


- indicate only the modem and not the queue that is 


allowed to transmit. The modem is responsible for 
selecting the appropriate queue for transmission in an 
increasing order of priority, so that the higher priority 
traffic achieve a better performance in terms of delay. 


III. PERFORMANCE EVALUATION 


For the evaluation of the above MAC mechanisms a 
computer simulation model was created using the 
PTOLEMY platform for modeling the AROMA MAC 
function under certain loading conditions. The scenarios 
used 10 cable modems loaded with uniform traffic for 
each priority. Each source used a common ON-OFF 
model, generating traffic at the slot/cell level with 
exponentially distributed on and off periods. The peak 
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rate was the system rate simulating the arrival of IP 
packets with geometrically distributed lengths, which 
were instantaneously segmented into slots. 

The 10% of the bandwidth was provided by the list as 
necessary for the polling of new bursts and the rest 
equally distributed among the priorities and modems in a 
way that the total was the one shown on the horizontal 
axis. 

The 1* priority is not presented since it exhibits 
deterministic behaviour, as the transmission permits are 
pre-programmed in the list [7], and thus, the delay never 
exceeds the fixed permit distance. The most important 
result caused by the prioritization scheme is the delay 
advantage provided to the higher priorities. The 
transmission of almost all the 2" priority cells delay less 
than 250 slots (i.e. 43ms) while those of the 3% less than 
350 slots (60ms). Of course there is no bound for the te 
priority which can exceed any limit depending on the 
total loading. In real-life though, this would cause no 
harm to the TCP-based user applications, because the 
tates of the sources would adapt to the possible 
bottleneck sharing the spare capacity equally among 
them due to TCP layer congestion control tools. 
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Figure 3: Mean delay vs. total load 


At the same figure we can observe the behavior of the 
system when no priorities are available except the 
unsolicited permits of the 1“ priority, ie. the sources 
which were used for the three other classes are all 
feeding the same queue, marked “all” in figure 3. The 
total load is the same with 10% used for the 1* priority 
in all cases. The benefits of prioritization are clear since 
no traffic class can enjoy bounded delay in this case. In 
contrast, when priorities are enforced the 2™ and the 3" 
classes enjoy a seemingly lightly loaded medium. Only 
the last priority sees a performance reduction, which 
however is well equipped to handle using the TCP layer 
congestion control tools. 


IV. CONCLUSION 


Tree-shaped shared medium access networks such as 
PONs and HFC effect a distributed multiplexing function 
which concentrates traffic from many users and many 
services with diverse requirements. The system thanks to 
prioritization can guarantee the QoS required by 


sensitive traffic while exploiting any umreserved 
bandwidth for the support of best-effort traffic. The 
DiffServe architecture is quite advantageous in this 
context and a MAC protocol taking advantage of this 
approach was implemented and evaluated with 
satisfactory results. 
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The Need for Wavelength Converters in All- 
Optical Transport Network 


Bostjan Batagelj, Student Member, IEEE 


Abstract--This paper describes the need of wavelength 
conversion in all-optical transport network. It focuses on the 
comparison of WP and VWP in terms of non-hierarchical all- 
optical network with blocking and with waiting queue. I am 
studying the system, where the requests for free wavelength 
path arrive randomly and I am assuming that it has Poisson 
distribution. The calculations of the blocking and delaying 
are made, and are shown in the traffic diagrams. This paper 
presents the benefits of wavelength conversion in WDM 
optical network. It turned out that VWP has a larger routing 
freedom than WP in both analysed systems. Clearly, the use 
of wavelength conversion reduces the blocking or delaying 
probability for a given utilization and size of a cross-connect. 
The results of our study show that the VWP with waiting 
queue can significantly improve the traffic performance of 
the network. 


Index terms--Optical network, wavelength converters, optical 
path 


I. INTRODUCTION 


The goal of all-optical network is to transport information 
between two district places. The capacity requirements for 
the nowadays all-optical transport network are boosted by 
the increase in number of users connected to the network, 
the bandwidth demand and the increasing usage of the 
services. Due to an increase in required capacity, the 
network should allow the transport of high bitrates. 

The evolution of all-optical network has begun by 
transporting single wavelength through each fibre. The 
next step (which is currently happening in many countries 
worldwide) was to improve the transmission efficiency by 
using Wavelength Division Multiplexing (WDM). Here 
different signals are multiplexed together and transported 
through a single fibre. So-called Wavelength Paths (WP) 
are established. A further step is to increase capacity by 
upgrading such WDM all-optical transport network with 
Wavelength Converters (WC) [1] that allow Virtual 
Wavelength Paths (VWP). 
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As we can see in Fig. 1, two different approaches 
supporting optical paths are possible depending on whether 
the signal is carried by the same wavelength along its 
travel throughout the network (WP) or is converted to 
another wavelength in the midstream [2] (VWP). The 
realisation of VWP requests wavelength converters. 


SOURCE SOURCE 
Ww) W\ 
WP: Wy VWP: W2 


y 
@xe) 


DESTINATION 


@) 
@ys 


DESTINATION 
Fig. 1. Optical WP and optical VWP. 


This paper focuses on the comparison of WP and VWP in 
terms of non-hierarchical all-optical transport network 
with blocking and with waiting queue. The calculations of 
blocking and delaying are made, and they are shown in the 
traffic diagrams. The benefits of wavelength conversion in 
WDM are presented. Therefore, it is shown that there is 
always a wavelength conversion gain. [3] 


II. TRAFFIC THEORY IN ALL-OPTICAL WDM NETWORK 


The wavelength cross-connecting will be implemented in 
optical cross-connect (OXC) and optical add-drop 
multiplexers (OADM). All OXC and OADM. are 
considered equivalent as far as routing of the traffic is 
concerned, Adaptive wavelength routing [4] in the non- 
hierarchical type of routing exploits the available capacity 
of optical network more efficiently than routing in the 
hierarchical system. In fact, the adaptive routing can 
function at lower capacities when overloading of the 
transport network is maximal. The disadvantage of the 
adaptive routing lies in the fact that it needs complex 
computer management in network nodes and efficient 
signalling. 
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The common-channel signalling system is a packet 
network so that we can use a-classical traffic theory for 
modelling and analysing it. Let us first recall the basics of 
traffic theory, which is used for mathematical traffic 
analysis of optical nodes and for the calculation of 
probability of blocking or delaying the request for unused 
wavelength. The request is blocked in the system 
consisting of a finite number of WDM channels and 
without a waiting queue. Its processing is delayed in the 
system with a finite number of WDM channels and with an 
infinite waiting queue. 


Cami Waiting queue Server Served 
coming requests for 
requestfor ‘ unused 

unused wavelength 
wavelength 


Fig. 2. Model of waiting queue for coming requests for unused 
wavelength. 


The traffic is determined by both the rate at which requests 
arrive and the length of time for which wavelengths are 
held. The traffic is expressed in mathematical terms such 
as 
p=Alu (1) 
where p is the traffic intensity in Erlangs or wavelength 
utilisation, A is the mean request arrival rate ( request per 
unit time), and 1/ is the mean connection holding time 
measured in the same time units as A [5]. 
The request for unused WDM channel arrives randomly. 
The saturation point is established as soon as p becomes 1. 
At that point the request for unused wavelength is blocked 
or delayed. 
Let us suppose that the request arrival process is 
memoryless and with exponential inter-request times. 


preth=e* (2) 


Then the random request is described by stochastic Poisson 
process with rate A [6]. . 


ALY 
p{k)= ay *  K=0,1,2,....220) 8) 
Let us suppose that the connection holding time also obeys 
the exponential distribution. 


p(T >t)=e" (4) 


III. BLOCKING SysTEM 


In this system the finite number of all WDM channels in 
the node exit is NV, and there is no waiting queue for the 
request for wavelength. 


Ns f5°W; (5) 


Fig. 3. System with finite number of WDM channels. 


The new coming requests are blocked as soon as 1 becomes 
equal to p [7]. 


A. Blocking System with VWP 


The number of engaged exit WDM channels 7 is definec 
between 1<n<N. The probability of occupancy of n WDM 
channels is 


P(n)=© PO) — osnsN ©) 


where P(0) is the probability of non-occupancy of WDM 
channels in the blocking system. The sum of al 
probabilities is 1. 


Pn) =I (7) 


The above relation can be used in calculating the 

probability of non-occupancy of WDM channels. 
e” -T(2+N) 

(1+ V)-T(01+N, p) 


where the Euler gamma function T(z) is defined by the 
integral 


P(0)= (8) 


P(2)= ("ear (9), 


For positive integer n, T(n)=(n—1)!. 


The incomplete gamma function T(z,a) is defined by the 
integral 


T(z,a)= [ t*e"dt (10) 


The probability of occupancy of n WDM channels (Eq.6) 
can now be written as 


A cee 
za) ‘ (1+N)-T(0+N, p) 


The blocking of request occurs as soon as the number o! 
engaged WDM channels n are equal to all WDM channel: 
N. The probability of blocking in the system with VWP i: 
Erlang's B distribution. 

p” _¢* -T(2+N) 


N! (1+N)-T(1+N,p) 


(11) 
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(12) 
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Fig. 4. Probability of blocking for VWP with 32 WDM channels. 


B. Blocking System with WP 


In the blocking system with WP probability the blocking of 
request for one WDM channel is 
(eles -T(@) 
2-T(2,p) 
Since there is only one fiber, the eq. (13) is also the 
probability of blocking in the blocking system with WP. 


Pry =p (13) 


emt fe) 

are ERENT 
The two kinds of studied networks can be viewed as two 
extremes of a range of possible network types. In the 
system where W,=1, VWP and WP are equivalent. The 
wavelength converters in this system are needless, because 
this is not the WDM system. When the VW, is a high 
number, the WP solution is becoming useless, because the 
probability of blocking in this case is rising sharply. 
Optical system planing with defined probability of blocking 
could also include a mixture of both VWP and VP. 


(14) 
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Fig. 5. Comparison of blocking system with VWP to WP (number 
of fibers F,=1, number of wavelengths per fiber W.=1,2,4,8). 


IV. DELAY SYSTEM 


As soon as 4 becomes equal to yw, the waiting queue is 
established. The request can be delayed, or can be held 
within the system in case it gets access to a free 
wavelength. Buffered requests are processed when a 
requested wavelength is free. This system is without 
blocking of request, because it has infinite waiting queue. 
Two nodes are connected with F, optical fibers and each of 
them have W, wavelengths. Therefore, there is NV available 
WDM channels, see Eq. (5). "Connection request" signals 
are coming from the source terminals to the first node and 
they are buffering in the waiting queue until one of the 
WDM channels is free. If the request is waiting in the 
queue, it is delayed. 


A. Delay System with VWP 


The delay system is modelled as an infinite waiting queue 
and finite number of WDM channels. 


N=00 


Fig. 6. Model of delay system. 


The probability of occupancy of » WDM channels is 


P(n)=2 P0) 


LT ee.) 
n)=*_(2) P(0) , n=N ~ (6) 


where P(0) is the probability of non-occupancy of WDM 
channels in a delay system. The sum, as already shown, of 
all probabilities equals 1. Applying eq. (15) and (16) it 
follows the probability of non-occupancy of WDM 
channels in the delay system. 


eee EUNo) LIN. DON Io 
oa ao)” 


Delaying is turned up, when all wavelength channels are 
occupied. At that point n>N. Using equation (16), the 
probability of delay can be calculated. 


5 OsnsNy (15) 


P delay),vwe = > Ptr) (18) 
n=N 
N pv 
Fase = NT P(0) (19) 
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Applying the probability of non-occupancy of WDM 
channels in the delay system (Eq. 17) it follows 


-1 


Pp 
(tee -[1- 1.2) 7) Tm OD) 


This expression is known as the Erlang delay formula. 
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Fig. 7. Probability of delay for VWP with 32 WDM channels. 


B. Delay System with WP 


In the delay system with WP probability, the delay of 
request for one WDM channel is 


e” ‘a 
ee pice Use) =p: (Q) 


Since there is only one fiber, there is also the probability of 
delay in the delay system with WP. 
Praclay\we = P (22) 


Again, the two network types studied can be viewed as two 
extremes of a range of possible network types. In this case, 
the WC also reduces blocking of requests for wavelength. 
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Fig. 8. Comparison of delay system with VWP to WP (number of 
fibers F,=1, number of wavelengths per fiber W.=1,2,4,8). 


V. CONCLUSION 


The paper has dealt with the wavelength routing in an all- 
optical WDM _ network. The comparison between 
wavelength path (WP) and virtual wavelength path (VWP) 
is done. It turned out that VWP has a larger routing 
freedom than WP in non-hierarchical transport system 
with blocking and in non-hierarchical transport system 
with waiting queues. Obviously a network with WC is 
more flexible and has smaller blocking probability. 
Therefore, there is always a wavelength conversion gain. 
The results of my study show that the traffic performance 
of the network can be significantly improved in the case of 
VWP with waiting queue. It would be worthwhile to define 
the maximum allowed delay time for different services. 


From the point of theoretical calculations of the traffic in 
different systems, it is suggested that the backbone network 
must be modular, which gives a flexible arrangement to it. 
Therefore, at the beginning, network operator can build an 
optical network based on WP because the WCs are not 
mature and the traffic is small. 

The modularity of the network must be considered from the 
beginning so that the backbone network can be upgraded 
with WCs into VWP when the need of high connectivity 
and high bite rates of traffic will be requested from the 
service provider. The wavelength interchange device is the 
key element for a VWP switching stage. As it has been 
demonstrated, a certain number of these devices (which 
have not yet been fully developed) is enough to build a 
required broadband all-optical network which will be 
extended in the distant future. 
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Self-Adaptive WWW Cache Coherence 
Protocol (SATTL) 
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Abstract — Proxy caches are widely used in wide-area 
network environment (Internet) to improve the performance 
of client-server architectures. However, the use of caches also 
introduces a coherence problem. To maintain coherence, 
World Wide Web (WWW) caches use coherence protocols, 
which can either be proxy or server oriented. 

In this paper, we present a new adaptive WWW cache 
coherence protocol that combines both the proxy-oriented and 
the server-oriented protocols. The adaptive protocol keeps 
track of object’s changes in order to improve coherence and to 
reduce network traffic. Basic mechanisms of the adaptive 
protocol are described, as well as the results of performance 
comparison with other known proxy-oriented and server 
oriented solutions. 

Index Terms — Cache coherence, proxy-oriented 
coherence protocols, server-oriented coherence protocols, 
adaptive cache coherence protocols, wide-area network, 
Internet. 


I. INTRODUCTION 


The rapid growth and availability of wide-area network 
services have significantly contributed to the almost 
exponential growth of the number of users and data 
transfers, which impose network congestion. To enable 
further growth and to achieve the expected quality of 
services, it is necessary to decrease the amount of network 
traffic or to change the network infrastructure. 


Data replication and caching techniques offer an 
efficient solution to this problem, without expensive 
infrastructure changes. They can significantly decrease 
network load and improve response time, but they also 
introduce the coherence problem between copies of the 
same object. To maintain coherence, different protocols are 
introduced. The coherence protocols introduce additional 
network traffic, which should not overcome the gain of 
object replication. Transparent replication is widely used in 
the wide-area network environment and has significant 
importance to achieve good performance. A _ proxy 
implements a network cache, which is usually used in a 
client-server architecture in order to improve performance. 
The proxy cache exploits data temporality and spatial 
locality of object accesses in order to decrease network 
traffic and to improve access time. 
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In proxy-oriented protocols, the proxy cache 
optimistically keeps a copy of the data object for some 
limited amount of time (TTL — time to live), assuming that 
the copy is coherent with the origin server object. If the 
requested object is cached by the proxy and the TTL did not 
expire, the client will receive a copy of the object directly 
from the proxy. If the TTL period expires, the proxy checks 
the server to see if the origin object has changed. All 
changes to the object during the TTL period will not be 
seen by the proxy cache nor by the clients connected to the 
given proxy machine. In contrast, if server-oriented 
protocols are used and the data object is changed, the server 
is responsible for sending invalidation messages or updating 
information to all proxy caches that have a copy of the 
given data object. Some or all of the proxy caches and 
server machines may be locked until a change propagates 
throughout the system and all acknowledgement messages 
have been received. 


The coherence of copies in the WWW can be improved 
by either decreasing the TTL period for the proxy-oriented 
protocols, or by increasing the locking period for server- 
oriented protocols. However, both of these solutions 
increase latency and network traffic. 


This paper describes a self-adaptive protocol (SATTL) 
for maintaining coherence of replicated objects in wide-area 
network. The self-adaptive protocol uses invalidation to 
improve coherence and decrease costs of maintaining 
coherence by adjusting the time of object validation for 
each object independently. The adjustment is a result of s 
previous validation check and server notification of object 
changes. The protocol is independent of object age as 
Opposed to current protocols. To decrease the network 
traffic, invalidation messages are transmitted using 
unreliable communication. Simulation results show the 
performance improvement that can be obtained by using 
our SATTL protocol. 


In Section 2 we describe the architecture and current 
widespread cache coherence protocols. Section 3 presents 
our self-adaptive protocol. We evaluate our protocol and 
compare with other protocols in Section 4. Final comments 
on results of performance evaluation are given in Section 5. 


Il. WWW CACHE COHERENCE PROTOCOLS 


Fixed TTL (FTTL) is a server-oriented cache coherence 
protocol. The server predicts the TTL attribute separately 
for each object. A cached copy is considered valid until its 
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TTL expires; at which point the next request results in a 
validation check with "i-modified-since” (IMS) message. 
However, it is difficult to assign a fixed value for the TTL 
parameter in advance. For small values, the server will be 
overloaded with many IMS messages, although the object is 
not changed. On other hand, for larger values the 
probability of reading a stale copy increases. 


The widely-used Adaptive TTL (ATTL) protocol, also 
known as the Alex protocol [1] is a proxy-oriented protocol. 
The proxy ensures coherence by adjusting the TTL 
parameter based on the age of the object. It takes into 
account the fact that if an object has not been modified for a 
long time, it tends to stay unchanged. The TTL value is 
proportional to the object's current age. 


The FITL and ATTL protocols guarantee that the 
returned object is up-to-date within the specified TTL 
period. If server, proxy or network failure happens, the user 
is notified of the failure. The user can then choose to view 
the cached copy. The most widespread caches combine the 
FTTL and ATTL protocols, for example the Squid object 
cache and the Harvest object cache [2]. 


The invalidation protocol is a server-oriented protocol 
[3]. It relies on the server to send out notifications when an 
object is modified. This implies server responsibility of 
keeping track of the location of object copies and sending 
invalidations to the proxy when the object is changed. If the 
requested object is cached by the proxy and not invalidated, 
the client will receive a copy of the object directly from 
proxy. If the object is invalidated, the proxy sends a request 
to the origin server. Since wide-area networks should be 
considered unreliable, it is necessary to implement 
appropriate validation checks to ensure that network failure 
did not prevent delivery of invalidations from the server. 
Invalidation protocols provide better coherence than TTL 
based protocols, because the network delay of invalidation 
is considerably smaller than duration of TTL. However, this 
protocol incurs additional requirements on server resources 
and increases network traffic. 


Invalidation protocols guarantee that the returned object 
is up-to-date within the time that it takes to propagate the 
invalidation from server to proxy. If a network failure 
prevents the flow of information from the server to the 
client, the user might view stale objects for an extended 


Figure 1: WWW architecture model 


period of time without knowing it. To resolve this problem, 
the AFS protocol [4] periodically checks the server to make 
sure that the proxy can communicate with the server. 


We use a simplified network model to analyze the 
network traffic contributed by the cache coherence 
protocols. In Figure 1, the WWW architecture is presented, 
where clients access server objects through the proxy cache. 
Messages can be delivered reliably, in which case they are 
guaranteed to arrive, or unreliably, meaning the system will 
make an attempt to deliver the message, but that the 
delivery is not guaranteed. 


III. THE ADAPTIVE WWW CACHE COHERENCE 
PROTOCOL 


We developed the self-adaptive protocol (SATTL) for 
maintaining coherence of replicated objects in wide-area 
networks [5]. To improve coherence and to reduce network 
traffic, we combine the invalidation protocol and the TTL- 
based protocol. The server keeps track of object copies, 
which enables sending invalidations to the proxies that 
holds object copies. While the server-oriented invalidation 
mechanism is used to improve cache coherence, the proxy- 
oriented TTL mechanism prevents incoherence due to the 
server and network failure. Since the proxy-oriented TTL 
protocol prevents incoherence, invalidation messages are 
transmitted using unreliable communication to additionally 
reduce network load. All other messages are use reliable 
communication. 


Our protocol dynamically adjusts the TTL for each 
object. The adjustment is based on the validation check 
result (IMS messages) and on received invalidation 
requests. The protocol is independent of object age as 
opposed to currently used ATTL protocols. If a proxy 
receives a copy of an object for the first time, it also 
receives the initial values of TTL. The server assigns the 
initial value of TTL to the object using the same heuristics 
as the FTTL protocols. However, if the initial value of TTL 
is not assigned by the server, the proxy determines the TTL 
value the same way as the ATTL protocol. After the copy 
of object is cached, the proxy independently either 
decreases or increases the value of TTL according to the 
changes of the original object. 


Two events decrease the value of TTL. If the TTL has 
not expired and the proxy receives an invalidation message 
from server, then the proxy will decrease the TTL. If the 
proxy sends an IMS message to the server and the server 
responds that the object has changed, the TTL is decreased 
as well. 


Three events increase the value of TTL: /) the proxy 
sends an IMS message to the server and the server responds 
that the object has not changed, ii) the proxy sends an IMS 
message to the server and the server responds that the 
object has changed after the TTL expires and the proxy did 
not receive an invalidation message, and ii) the proxy 
receives the invalidation message after the TTL expires. 


One of the scenarios which decreases the value of TTL 


is presented in detail in Figure 2a. The initial TTL assigned 
by the server expires at time ¢;. The server changes the 
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object at time f before the TTL value expires (fo < ¢,) and 
sends the invalidation to the proxy. During the period f the 
proxy delivers the copies of the object to the clients. Upon 
receiving the invalidation, the proxy decreases the TTL 
value to reduce the possibility of delivering of stale copies, 
while next client’s requests will be sent to origin server. 


The new TTL value Ty, is calculated based on the 
old value Trt: Tr, = Tro - k, *(t; fo). The decrease is 


proportional to the difference between the old Tri value 
(which expires at time f,) and the time at which the object is 
actually changed (fo). The difference (t; — fo) is multiplied 
by the adjustment factor k,. The adjustment factor 
determines the speed of converging of initial T;7, value to 
the actual value at which the object is changed. Our 
simulation shows that the best results are achieved with the 
value of 0.25 for the adjustment factor k,. 


One of the scenarios which increases the value of TTL 
is presented in Figure 2b. The initial TTL assigned by the 
server expires at time ¢;. During the period #, the proxy 
delivers the copies of the object to the clients. After the 
TTL expires and a client requests the object, a proxy sends 
an IMS request to the server. Since the object is not 
changed, the server responds with the "not modified" 
message. Upon receiving the "not modified" message at the 
time t,, the proxy increases the TTL value to reduce the 
frequency of validation checks. 


As before, the new TTL value T;7, is calculated based 


on the old value Trr,: To =Tm +k, *@-&). The 


increase is proportional to the difference between the time 
at which the proxy receives the "not modified" message (t4) 
and the old Tyr, value (which expires at time #). The 
difference (¢, — f;) is multiplied by the adjustment factor k,. 


Without network failure, our protocol would have the 
same hit rate to the coherent copy as the invalidation 
protocol. Since we use an unreliable communication 
mechanism, while the invalidation protocol uses a reliable 
mechanism, our protocol will incur lower network load. 


If network failure prevents the server to send 
invalidation messages, the hit rate to the coherent copy that 
our protocol achieves is significantly higher than the hit rate 
for the invalidation protocol and other TTL based protocols. 


TV. SIMULATION RESULTS AND PERFORMANCE 
MEASUREMENTS 


We use simulation to compare our self-adaptive 
protocol with other protocols. Since the cost of protocols 
depends on the hit rate to incoherent copies, and since 
reliable traffic and unreliable traffic incur different network 
load, these three costs are combined into the single cost 
function: Cost = Cy) *N+c, *COPRR +c, *COPRy. 
Incoherence N is the average number of hits to the 
incoherent copies per request. Coherence reliable overhead 
per reference COPRp is the average reliable traffic per 
request and coherence unreliable overhead per reference 
COPRy is the average unreliable traffic per request. 


284 


clients proxy server 
A 
= 


[omen te woes 


y time 
b) TTL extension 


message types: —-—» request ——* object data 
Figure 2: Adjusting TTL 


The factors co, c; and c, determine the significance of 
the associated costs in a single cost function. In order to 
simplify the presentation of results, we chose constant 
values for the factors co, c; and c>. For different applications 
and network parameters we can adapt these factors to the 
actual importance of a particular cost function N, COPRp 
and COPR, and technical data. 


The simulation results presented in this section are 
obtained by an artificial workload. We simulate object 
lifetime using a normal distribution with a mean equal to 
100 min and a standard deviation of 10 min. 


Figure 3 presents how the protocol cost depends on the 
initial TLL value determined by the server. The initial TTL 
values are modeled by a normal distribution. The x-axis of 
the graphs shows the initial mean values for TTL. The 
standard deviation for all mean values is 100 min. 


We made the assumption that 10% of unreliable traffic 
is lost and that there is no reliable retransmission. To obtain 
the protocol cost, we assumed that reliable traffic (c,;=3) is 
more expensive than unreliable traffic (c)=1). We have 
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presented the protocol cost for simulated workload using a 
different incoherence factor. Figure 3a shows the cost 
functions that use co equal to 15, while Figure 3b shows the 
cost function that uses co equal to 90. 


As we can see from Figure 3, ATTL (@) and the 
invalidation (#) protocols are independent of initial TTL, 
because they do not use an initially determined TTL by the 
server. Since the invalidation protocol uses reliable 
communication, incoherence N is equal to zero. The ATTL 
protocol incurs lower traffic, but incoherence WN is greater 
than zero. Therefore, if incoherence N is not as important as 
traffic (Figure 3a, co=15), the ATTL protocol achieves 
lower cost than the invalidation protocol. However, if we 
increase the significance for the incoherence (Figure 3), 
co=90), the invalidation protocol achieves lower cost than 
the ATTL protocol. 


As we expect, the cost for the FTTL protocol (4) 
becomes lower for initial TTL values that are close to the 
real object lifetime (Figure 3a, TTL=100min). Moreover, if 
the cost for incoherence increases, then using small initial 
TTL values (Figure 3b, TTL <100min) may reduce the cost. 


For kp=0 the SATLL protocol (QO) does not adjust the 
TTL. In order to achieve lower cost, the initial TTL value 
should be increased (Figure 3a, TTL>100min). As the 
initial TTL value increases, we reduce the reliable traffic 
imposed by IMS requests, while the invalidation messages 
keep incoherence low by using cheaper unreliable traffic. 
However, 10% of lost invalidation messages cause a small 
increase in the cost for TTL values greater than 100 min. 


If we use the adjustment factor kp=0.25 (@), it will 
significantly improve performance of our protocol. 
Moreover, the cost becomes independent of initial TTL. For 
small initial TTL values (TTL<100min), the SATTL 
protocol increases the TTL value to be close to the real 
object lifetime value, which significantly reduces reliable 
traffic caused by the IMS requests and responses. For larger 
initial TTL values (TTL>100min), ‘the SATTL protocol 
decreases the TTL value, which reduces incoherence due to 
the lost invalidation messages. 


V. CONCLUSION 


This paper presents a new adaptive WWW cache 
coherence protocol (SATTL) that combines both TTL based 
mechanism and the invalidation mechanism. Our protocol 
dynamically adjust the TTL value for each object based on 
the validation check result ("ifmodified-since" message) 
and based on invalidation requests. 


We compare our protocol and other currently used 
protocols: server-oriented invalidation protocols and 
protocols based on TTL. In order to improve the 
performance of an application, it is difficult to choose the 
appropriate protocol and to set appropriate parameters for a 
chosen protocol. For example, if incoherence cost is more 
significant than the traffic cost, then the invalidation 
protocol shows better performance. However, if 
incoherence cost is not significant, protocols based on TTL 
should be used. We show that the performance of TTL 
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Figure 3: Performance comparison 


based protocols depends on initially determined TTL value 
by the server. 


The simulation results presented in this paper show that 
the performance of our SATTL protocol does not depend on 
initially assigned TTL values. Moreover, the cost for our 
protocol is lower than the cost for other protocols. This 
enables us to use our protocol regardless of the relative 
importance of incoherence, traffic, and initial TTL values of 
the server. 
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Performance of UDP and TCP Communication on Personal 
Computers 


Andro Milanovic, Sinisa Srbljic, and Vlado Sruk 


Abstract — The number of personal computers (PCs) 
in Internet based information systems has been 
rapidly increasing during the last few years. Designers 
are trying to improve the performance by choosing 
the appropriate hardware and software components 
of PCs that are running Internet based information 
systems. However, little research has been done to find 
how different hardware and software components of 
PCs affect the performance of the two most commonly 
used protocols for transferring data on Internet today, 
TCP and UDP. 


This paper compares data transmission speed for three 
different operating systems: Windows 95, Windows NT 4.0 
Workstation and Linux 2. 1.132. Three different hardware 
configurations based on Intel 486 DX2, Intel Pentium, and 
Intel Celeron A processors are also compared. Finally, we 
show how data packet size affects the communication speed. 


Index Terms — UDP, TCP, performance measure, personal 
computers. 


I. INTRODUCTION 


The Internet has shown amazing growth in recent 
years and its growth is still not slowing down. The 
number of personal computers and their role in the 
Internet have increased as well. Designers of PC-based 
Internet systems must make decisions based on a large 
number of various parameters. Some of the parameters 
designers have to consider are processor, memory, 
network card, LAN technology, operating system, and 
applications. It is hard to find research papers and 
performance results that present how these parameters 
affect the performance of Internet communication of 
personal computers. 


The paper [2] gives performance results for Internet 
communication, but it focuses on RPC-based 
communication and Unix workstations. In this paper, we 
give the results for PCs and the two most commonly used 
protocols for transferring data on Internet today, TCP and 
UDP [3-4]. The first protocol is used for reliable 
communication, while the other protocol, UDP, shows 
better results in high-speed communication where a 
certain level of unreliability is acceptable. 


We present the performance results for UDP and 
TCP communication speed as an extension to the paper 
[1]. The results for UDP and TCP in this paper were 
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obtained by measuring the data transmission speed 
between two PCs. We varied different hardware 
configurations of PCs, operating systems and message 
sizes to find how these three parameters affect data 
transmission speed. 


In Section 2, we describe the measurement 
environment including the software package used in the 
measurement process. The performance measurement 
results for UDP are presented and discussed in Section 3. 
Section 4 describes the results of performance 
measurements for TCP. Final comments on the 
performance measurement results are given in Section 5. 


II. MEASUREMENT ENVIRONMENT 


We developed a small software package for the 
purpose of measuring the performance. The package 
consists of two programs: the Sender and the Receiver. 
There are two versions of the testing package; one 
version is intended for use on the Windows 95 and 
Windows NT 4.0 operating systems, while the other is 
used with Linux. 


The Sender program sends data and the Receiver 
program receives data. The following three parameters of 
the Sender program are configurable: the amount of data, 
the sending speed, and the size of data load. The Receiver 
program counts the received data, because UDP does not 
guarantee delivery of the sent data. 


All tests were performed on a local network using 10 
Mb/s Ethernet, and the test machines were always in the 
same segment. Since we chose the receiving speed as the 
performance measure, we used the fastest system 
available, a Celeron A 333 system, as the sending 
machine. A high-speed sending machine enabled us to 
find saturation speed of the receiving machines. 


Three different hardware configurations of PCs were 
tested and each of them is briefly described in the Table 
1. For the purpose of comparing different hardware 
parameters, all of these configurations were running the 
same operating system, Windows NT 4.0 Workstation. 


Table 1: Hardware configurations of PCs 


Intel 485 DX2__| ‘Intel Pentium | Intel CeleronA 
66 ae ws | 


Processor speed (MHz) 


We also performed tests with three different 
operating systems: Windows 95 OSR2, Windows NT 4.0 
Workstation, and Linux 2.1.132. All operating systems 
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Figure 1: The comparison of UDP transmission speed for different hardware configurations and 500 bytes of packet load 
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Figure 2: The comparison of UDP transmission speed for different hardware configurations and 2 bytes of packet load 


were tested with the same hardware configuration, an 
Intel Pentium 133 system. 


We used two data packet sizes in the measurements. 
One size was 500 bytes, representing an average data 
packet used on Internet in various protocols such as 
HTTP [5]. The other data packet size we used was two 
bytes per message and it presents how the system handles 
small data packets that carry either confirmation 
messages or some small data chunks. 


III. UDP MEASUREMENT RESULTS 


Figures 1-4 present the results of measurements of 
data transmission speed for the UDP protocol. Each 
figure consists of two types of graphs. The x-axis in both 
graph types represents the sending speed and is measured 
as kilobytes per second. The y-axis of Graphs la-4a 
represents the receiving speed and is measured as 
kilobytes per second, as well. The y-axis of Graphs 1b-4b 
represents the percentage of received data. This type of 
graph is useful in finding the threshold at which data 
transmission speed starts to decrease significantly. 


The graphs in Figures la and 15 show that the 
Celeron system is able to use the full capacity of 10 Mb/s 
Ethernet with data packets of 500 bytes. The Pentium 133 
system uses approximately half of the bandwidth, while 
the 486 system shows poor results using about 10% of the 
bandwidth. The comparison indicates that the data 


transmission speed largely depends on the hardware 
configuration, especially on the processor speed. 


The graphs in Figures 2a and 25 show significantly 
lower speed for the data load of two bytes. Low speed is 
caused by large message header overhead. The header for 
UDP packets used in tests consists of 8 bytes plus 20 
bytes for an IP header. Total size of IP and UDP headers 
is 28 bytes, making the headers 14 times larger than the 
data load. Sending and receiving small packets also 
causes frequent operating system calls that additionally 
slow down the communication speed. The Celeron 
system running at 300 MHz has the maximum data 
transmission speed of 6 kB/s. It is about 2.1 times faster 
than the Pentium system at 133 MHz, and about 3.4 times 
faster than the 486 system at 66 MHz. 


Figures 3a and 3b compare the three operating 
systems using 500 bytes of packet load. Linux is about 
two times faster than the Windows systems and is able to 
use the full capacity of 10 Mb/s Ethernet. The Windows 
95 curve shows large deviation in receiving speed that 
occurs at 500 kB/s. For a data transmission speed of 550 
kB/s and higher, Windows 95 shows better results than 
Windows NT. 


The difference between Linux and Windows systems 
is larger for the data load of two bytes. This is presented 
in Figures 4a and 4b. The Linux curve in Figures 4a and 
4b has no threshold, because we could not reach the 
threshold with the sending machine we used. In our 
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Figure 3: The comparison of UDP transmission speed for different operating systems and 500 bytes of packet load 
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Figure 4: The comparison of UDP transmission speed for different operating systems and 2 bytes of packet load 


measurements Linux greatly outperformed the Windows 
systems with at least four times better results, but the 
actual maximum speed of Linux might be even higher. 
Except for a significant deviation of Windows 95 curve at 
2 kB/s, the characteristics for Windows NT 4.0 and 
Windows 95 are similar. 


IV. TCP MEASUREMENT RESULTS 


One graph is sufficient to present the measurement 
results for TCP, as shown in Figures 5-8. Since TCP is a 
reliable protocol, the receiving speed is always equal to 
the sending speed. Therefore, graphs in Figures 5-8 
consist of bars, where each bar represents the maximum 
receiving speed for the given configuration. Since TCP is 
a data stream oriented protocol, it packs the sequence of 
small data portions in one large TCP segment whose size 
depends on internal algorithms. Thus, the data 
transmission speed of small messages is higher for TCP 
than for UDP, which is evident if Figures 6 and 8 are 
compared with Figures 2 and 4. 


In Figure 5, the results for reliable TCP 
communication with messages of 500 bytes are 
compared. The Celeron system is able to achieve the full 
speed of the TCP communication on 10 Mb/s Ethernet. 
The Celeron machine also has 15% better results than the 
Pentium system, and twice the speed of the 486 system. 


Figure 6 shows the maximum speed of the three 
systems with the message size of two bytes. The 
maximum speed obtained in this measurement is 
significantly lower than before. The Celeron system is 
almost twice as fast as the Pentium machine and about 
three times faster than the 486 machine. Since the 
Celeron machine in the previous test reaches the 
maximum capacity of the underlying network, the 
difference between the Celeron and the other systems in 
this test case is larger than before. 


Figure 7 compares different operating systems for the 
message size of 500 bytes. Linux is able to use the whole 
bandwidth, while the Windows systems have 15% lower 
data transmission speed. Comparing the two Windows 
systems, they show similar results, with Windows NT 
having 4% lower speed than Windows 95. 


The results of measurement with two-byte messages 
are presented in Figure 8. The Linux system has 
approximately 11% better results than both Windows 
systems. Windows 95 has higher data transmission speed 
than Windows NT 4.0, but the difference is less than 4%. 
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Figure 5: The comparison of TCP transmission 
speed for different hardware configurations of PCs 
and 500-byte messages 
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Figure 7: The comparison of TCP transmission speed 
for different operating systems and 500-byte messages 


V. CONCLUSION 


In this paper, we describe the set of measurements 
based on personal computers and 10 Mb/s Ethernet. We 
measured the data transmission speed between personal 
computers for two basic Internet data protocols, UDP and 
TCP. Different hardware configurations of PCs, different 
operating systems as well as two message sizes were used 
to find out how each of these parameters affects the 
communication speed. 


Our results show that the hardware configuration 
significantly affects the communication speed of both 
UDP and TCP. Of the three tested hardware 
configurations, only the Celeron system is able to use the 
full capacity of 10 Mb/s Ethernet with 500-byte 
messages. The Pentium system and the 486 system have 
significantly lower speed. All tested systems achieved 
much higher data transmission speed with the larger 
messages. However, the difference between tested 
hardware configurations is larger for smaller data loads. 


The comparison of operating systems for UDP and 
TCP communication shows that Linux outperforms both 
Windows systems. Windows 95 and Windows NT have 
similar results, with Windows 95 being slightly faster. In 
both UDP and TCP communication Linux uses the full 
capacity of 10 Mb/s Ethernet with larger messages. 
Though the Linux system is significantly faster than the 
Windows systems in UDP communication, the difference 
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Figure 6: The comparison of TCP transmission 
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Figure 8: The comparison of TCP transmission speed for 
different operating systems and 2-byte messages 


between these systems becomes smaller with TCP 
communication. 


This paper shows that for small messages it is 
necessary to use very fast personal computer systems and 
real-time capable operating systems, in order to use the 
whole capacity of high-speed local networks. With larger 
data loads it becomes possible to use more of the network 
bandwidth. The paper also shows that the requirements 
for personal computers become less demanding if larger 
messages are used in communication between PCs. 


VI. ACKNOWLEDGMENT 


The authors wish to thank A. Grbic, Zeljka Mihajlovic, 
and Leonardo Jelenkovic for providing valuable 
comments on the research presented in this paper 


VII. REFERENCES 


(1) A. Milanovie, S. Srbljic and J. Radej: " Performance of Distributed Systems Based 
on Ethernet and Personal Computers ", Proceedings of ISIE '99, Bled, Slovenia, 
1999. 


[2] Chandramonah A. Thekkath and Henry M. Levy: "Limits to Low-Latency 
Communication on High-Speed Networks", ACM Transactions on Computer Systems, 
Vol. 11, No. 2, May 1993, pp. 179-203. 


(3) Douglas E. Comer: “Internetworking with TCP/IP”, Vol. 1, Prentice Hall Inc., 
1991 

[4] T. Socolofsky, C. Kale: “A TCP/IP Tutorial”, RFC 1180, Spider Systems Limited, 
January 1991 

[5] R. Fielding, et. al: “Hypertext Transfer Protocol - HTTP/1.1”, RFC 2068, UC 


Irvine, Jarmary 1997 


10 Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 289 


A Survey on the Java-based Approaches for Web Database Connectivity’ 
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Abstract 


The undeniable popularity of the web makes the efficient 
accessing of distributed databases from web clients an 
important topic. Various methods for web database 
integration have been proposed but recently there is an 
increasing interest on those based on Java-based ones. This is 
due to the inherent advantages of Java, which supports 
platform independence and secure program execution, and 
produces a small size of compiled code. In this experimental 
paper, we evaluate all currently available Java-based 
approaches. These include Java applets, Java Sockets, 
Serviets, Remote Method Invocation, CORBA, and Mobile 
Agents. To this end, we implemented a Web client accessing a 
remote database using each of these approaches and compared 
their behavior along the following important parameters: (1) 
performance expressed in terms of response time under 
different loads, (2) transparency of communication expressed in 
terms of complexity of networking API, and (3) extensibility 
expressed in terms of ease of adding new components. 


Keywords: Distributed Databases, WWW, Java Mobile Agents, 
Distributed Objects, CORBA 


I. INTRODUCTION 


Providing efficient access to distributed databases from 
Web clients using a Web browser [2] is crucial for the 
emerging database applications such as E-Commerce. 
Several methods for Web database connectivity and 
integration have been proposed such as CGI scripts, active 
servers pages, server side include, databases speaking http, 
external viewers or plug-ins, proxy-based, and HyperWave 
[4]. However, there is an increasing interest in those that are 
Java-based due to the inherent advantages of Java [1], 
namely, platform independence support, secure program 
execution, and production of a small size of compiled code. 

Several Java-based methods are currently available for 
Web database integration but in the best of our knowledge, 
there is no quantitative comparison of them. This 
experimental paper contributes such a comparison. 
Specifically, it evaluates six approaches, namely, Java 
JDBC applet, Java Sockets [7], Serviet [7], Remote Method 
Invocation (RMI) [7], CORBA [9], and Java mobile agents 
(JMA) [3]. Each approach differs in the way the client 
establishes connection with remote database servers. 

For our evaluation, we used each of these approaches 
to implement a Web client accessing a remote database and 
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compared their behavior along the following important 
parameters: (1) performance expressed in terms of response 
time under different loads, (2) ‘transparency of 
communication expressed in terms of complexity of 
networking API, and (3) extensibility expressed in terms of 
ease of adding new components. Further, we characterized 
these approaches in terms of the total development effort 
based on lines of code at both the client and the server side 
in conjunction with the two latter parameters, namely, 
transparency and extensibility. 

In the next section, we briefly describe our testbed. In 
Sections III to IV, we elaborate on the characteristics of 
each approach when comparing them along the dimensions 
of communication transparency and extensibility. In section 
VI, we present our performance evaluation results. 


II. EXPERIMENTAL TESTBED 


Two design principles were adopted in the selection of the 
various components during the development of the testbed. 
First, our Web clients should be lean with the purpose of 
allowing fast downloads and therefore increasing support 
for wireless and mobile clients. Second, no a-priori 
configuration of the Web client should be necessary to run 
the experiments in order to maintain portability, and 
therefore support arbitrary clients. 

For every approach, our Web client program was a 
Java applet, which was installed on a Web server machine 
along with an html page. Every experiment was initiated by 
first pointing to the html page from a remote client 
computer (Figure 1). After the Java applet was initialized at 
the client computer, queries were issued through the 
applet's GUI and executed at the remote database server. 
Our remote database server, a 3-table relational Microsoft 
Access database, was installed on the same machine with 
the Web server. The communication between the server and 
the client computer was a wireless LAN at 1.2Mbps. 

In all cases, the client establishes Web database 
connectivity through a middleware program typically 
running on the Web server machine. For the Java JDBC 
applet, we used a type 3 JDBC driver in accordance to our 
design criteria. In this case, the middleware corresponds to 
the middle-tier gateway of the type 3 JDBC driver [8]. In 
the case of JMA, the middleware is a local stationary agent 
that provides the information necessary for a mobile agent 
to load the appropriate JDBC driver and connect to the 
database server. In our experiments, we used DBMS-aglets 
[6]. In all other cases, we developed the middleware 
program, which plays the role of an application server that 
uses a JDBC-ODBC bridge driver to connect to the 
database server. For more details, see [5]. 
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Figure 1: Basic configuration 


III. TRANSPARENCY OF COMMUNICATION 


The dimension of the transparency of communication deals 
with the level of abstraction of communication between the 
client and the server side, in other words, between the client 
and the middleware program. The approaches can be 
broadly classified as (1) non-RPC ones, that do not support 
any clear remote method invocation mechanism, and (2) 
RPC ones with clear remote method invocation semantics. 
The Java Sockets and Servlets are non-RPC approaches in 
which information between the client and the middleware 
program is exchanged using streams of data. Java JDDC 
applets, RMI, CORBA and JMA are all RPC approaches. 

Table 1 compares the transparency of communications 
of the discussed approaches based on the complexity of the 
networking API employed by each approach. Clearly, the 
RPC approaches involve less complex networking APIs and 
hence more transparent client/server communication. 

The CORBA approach offers the highest level of 
communication transparency since it requires knowledge of 
neither URLs nor port numbers to establish database 
connectivity. It only requires the reference name with 
which the application server was registered at the server 
site. One level lower (high) is the RMI approach, which 
requires the URL of the database server along with the 
reference name of the application server. Similar to the 
RMI approach, the Java JDBC Applet approach requires the 
URL of the database server, and a data source name, which 
identifies the database itself. In this same level is the JIMA 
approach. Mobile agents identify remote host machines 
with their URL, and interact with other agents using their 
unique identifiers. One level below (/ow) is the Servlet 
approach requiring a URL, the servlet name, and the type of 
operation to be executed by the particular servlet. Finally, 
as expected, the approach with the /Jowest communication 
transparency is the socket approach, which requires 
knowledge of the IP and of the port number of the 
application server. 


IV. EXTENSIBILITY 


We define extensibility to be: (a) the ability of adding new 
components to an approach (e.g., a new application server 
object attached to a local or a remote database) and binding 
them with the existing ones at the server site; and (b) the 
level of modifications needed at the client part that will 
enable the client to utilize newly added components. We 
classified the various approach in terms of extensibility as 
highest, high, average and low (Table 1). 

The approaches with the highest degree of extensibility 
are the CORBA and JMA. In the one based on CORBA, the 
application server and the client applet can bind to a newly 
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added component by only using its reference name. As 
opposed to other approaches, new components need not be 
necessarily located at the Web server machine in order for 
the client to bind to them. 

The JMA approach is inherently very extensible since 
mobile agents were designed to autonomously collect 
information and exploit any newly added servers in order to 
complete their execution plan. Moreover, the Web client 
need not be aware of the existence of new servers. 

The high extensibility of the JDBC applet approach is 
due to the type 3 JDBC driver used. Of all the JDBC 
drivers, type 3 drivers are the most extensible because of 
their middle-tier gateway that maps client applet’s database 
requests to any local or remote database calls. The Web 
client only needs to name the newly added databases. 

The servlet approach also offers high extensibility. 
Servlets execute in the context of the Web server and can 
call (explicitly) other servlets within the same context. This 
means new servlets can be added without any Web client 
modification. The client applet can also call explicitly a 
new servlet using as a reference the URL, the new servlet’s 
name and the type of operation that must be executed by it. 

Because of their similarities, one might have expected 
that RMI and CORBA approaches would exhibit the same 
degree of extensibility. However, compared to the CORBA 
approach, the RMI approach is much less extensible for 
three reasons. First, new components must be written only 
in Java. Second, new components are identified, besides of 
their reference name, with an additional URL. Lastly, the 
client applet cannot bind to new components that reside on 
URLs other than the Web server. Clearly, RMI also offers 
lower extensibility than the Java JDBC applet approach. 

Finally, the approach with the Jowest degree of 
extensibility is the socket one. For any new component, a 
new socket must be created, bound and managed either at 
the side of the application server or at the Web client. 


V. EFFORT OF DEVELOPMENT 


The effort of development basically combines the 
dimensions of transparency of communications and 
extensibility, and quantifies them in terms of lines of code. 
In Table 1, the lines of code for each approach are 
normalized with a constant C. 

The approaches with the lowest effort of programming 
are the Java JDBC applet and the CORBA. The applet 
approach combines the fewer relative lines of code, high 
level of network transparency and an average extensibility, 
while the CORBA approach offers the highest transparency 
and extensibility with relative small code size. 

The high extensibility and transparency of the JMA 
approach comes with a premium in terms of lines of code. 
Mobile agents involved significant programming. The RMI 
approach is the opposite of the JMA one. It requires a 
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relatively low number of lines of code and offers average 
extensibility. 

The Servlet and Socket approaches involve the most 
effort of development, given their large number of lines of 
code and their low transparency and extensibility. 


VI. PERFORMANCE EVALUATION 


For our performance evaluation, we measured the average 
response time for our Web client (a) to query the remote 
database for the first time, (b) to query the remote database 
for a number of subsequent times. Querying the remote 
database for the first time differs from subsequent queries 
because the first query involves the additional overhead of 
establishing the communication link between the client and 
the remote database. 

The other significant issue that we considered in our 
experiments is the size of the query result. Query result size 
directly affects the response time in two ways. First, in the 
amount of time spent for the query to execute, and second, 
in the transport time for the results to reach the client. We 
adjusted the size of the query result by changing the 
complexity of the SQL statement issued through the client 
applet. For our experiments we measured average response 
times for a wide range of query result sizes, beginning from 
128 bytes (8 tuples) up to 64 kilobytes (1000 tuples). For 
each approach, a sufficient number of runs were performed 
in order to obtain statistically significant results [11]. Below 
we first focus on the experiments for small query results 
(128 bytes) and then discuss how the response time of the 
different approaches is affected by the query size. 

Graph 1 shows the average response time for the initial 
and subsequent queries in each approach. For the initial 
query, the non-RPC approaches have by far the lowest 
response time. This can be explained by the fact that their 
initialization phase does not engage any special package 
loading or handling by the client. Compared to the Socket 
approach, the Servlet approach performs slightly worse 
because (a) the communication between the client and the 
servlet is marshaled by the Web server, and (b) by 
executing as a Web server thread. the servlet receives less 
CPU time than the socket application server. Thus, servlets 
respond slower to requests and require more time to 
assemble and return the query results. 

From the RPC approaches, the JMA approach offers 
the best performance for a single (initial) query. Significant 
part of its cost (around 2 seconds) is due to the process of 
dispatching the DBMS-aglet from the client applet to the 
aglet router on the Web server and from there to the 
database server. In the case of the CORBA approach, the 
first query is slightly more expensive than the one in the 
JMA approach because of the overhead of initializing the 
necessary ORB classes and the binding to the application 
server. This overhead is quite significant (around 3.20 
seconds) which can be clearly seen by comparing the 
response time of the initial and subsequent queries. 
Following the CORBA approach is the Java JDBC 
approach in which the response time of the initial query is 
increased by a considerable amount of time by the 
downloading of the JDBC driver from the Web server. 


= 
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Graph 1: Performance of all approaches for initial and subsequent query 
(128 bytes result size) 


To our surprise, the RMI approach performs by far the 
worst for the initial query. We expected the RMI approach 
to exhibit better performance because, as opposed to the 
other RPC approaches, it does not involve the loading of 
any specific package during initialization time. The only 
way to explain this behavior is to attribute the increased 
response time to the interpreted method of RMI calls when 
binding the client applet to the application server. 

For subsequent queries, the performance of the 
CORBA and RMI approaches dramatically improves, and 
becomes close to the best performance exhibited by the 
Socket approach. The reason is that the client applet is 
already bound to the remote application server and only a 
remote procedure call on the application server is required 
to query the database. For a similar reason, the Java JDBC 
applet approach also exhibits a significant performance 
improvement for subsequent queries - the JDBC driver is 
already downloaded and initialized at the client applet. 
Having the DBMS-aglet already connected to the remote 
database and ready to process a new query on behalf of the 
client applet, the JMA approach also improves its response 
time for subsequent queries. However, this response time is 
the worst from all the other approaches. We attribute this to 
two reasons. First, the two required messages to implement 
subsequent queries have to be routed through the aglet 
router, and second, a mobile agent is not a stand-alone 
process and it does not receive full CPU time. 

On the other hand, the Java Servlet approach improves 
only slightly its performance because the steps for 
executing a subsequent query do not differ from the ones 
for the initial query. The minor improvement is due to the 
fact that any subsequent URL connections from the client 
applet to the Web server require less time since the address 
of the Web has already been resolved in the initial query. 

In order to better illustrate the scalability of each 
approach, we plotted in Graph 2 the average time required 
by each approach to query the database for a number of 
consecutive requests using the formula: For n consecutive 
queries, the average time required is the sum of (a) the 
average response time for one initial query, and (b) n-1 
times the average response time for a subsequent query. 

As shown in Graph 2, the socket approach is the most 
efficient for any number of consecutive queries. Despite its 
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Graph 2: Average performance for up to 30 consecutive queries (128 bytes of 


result size) 


good performance for initial queries, the Servlet approach 
does not scale well since the response time for subsequent 
queries almost matches the response time for initial queries. 
Likewise, the JMA approach scales very badly given that its 
response time for subsequent queries is the worst of all the 
approaches. The CORBA, Java JDBC applet, and RMI 
approaches appear to scale well, however, the RMI 
approach appears less attractive due to its worst 
performance of all the approaches for initial queries. 

Graph 3 illustrates the sensitivity of each approach to 
the size of query results. Due to space limitations, we show 
here only the results for subsequent queries. The results for 
initial queries are similar. 

The first striking observation is that the response time 
of the Java JDBC applet and JMA approaches increases 
exponentially with query result sizes larger than 20KB. 
The Java JDBC applet approach performs by far the worst 
for increased result size. This can be explained by the fact 
that in JDBC rows from a query result are retrieved one at a 
time. Specifically, to retrieve one row from the query result, 
the client must call a method on a Java ResultSet object, 
which is mapped on the remote database server through the 
Gateway. Consequently, for a large size of query result, a 
large number of those remote calls have to take place. In 
that case, large query results not only increase dramatically 
the response time but they also increase the Internet traffic. 

The bad scaling of the JMA approach can be explained 
in the same way as the bad performance of the Servlet 
approach. Both mobile agents and servlets do not execute as 
stand-alone processes, and therefore, they do not receive 
full CPU time and heavily depend on the supporting 
execution environment. The other RPC approaches exhibit 
acceptable performances (close to linear for sizes above 
20KB) with the CORBA approach being slightly better. As 
indicated above, the implementation of RPC calls in 
CORBA is much faster compared to RMI’s one. 


VII. CONCLUSIONS 


In this experimental paper, we have implemented, 
evaluated, and compared all currently available Java-based 
approaches for Web database connectivity. Our comparison 
was based on the performance of query processing, the 
transparency of communication and extensibility. 
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The results of our comparison showed that the CORBA 
approach is the most transparent to communication, 
extensible and easy to develop, while its performance is 
comparable to the best performing approach that employs 
sockets. Hence, it offers the best promise for the 
development of large Web applications. 

In our study, we confirmed the desirable properties of 
the emerging mobile agents technology, that is, of high 
extensibility and transparency at a relatively low 
development effort. But, at the same time, our study 
provided an insight to potential scalability problems with 
the currently available mobile agent implementations. The 
JMA approach cannot support interactions that require 
movement or exchange of large amounts of data such as 
large number of consecutive queries with increased size of 
query result. Hence, it is necessary to develop more 
efficient mobile agent infrastructures, if the full potential of 
mobile agents is to be explored. As part of our future work, 
we investigate the possibility of merging mobile agents and 
the CORBA technology in order to facilitate a scalable and 
efficient Web database connectivity. 
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A Web-based telecommunication needs model- 
ing and implementation according to the EOTIP 
methodology 


A. Papantoniou, C. Anagnostopoulos, V. Loumos and E. Kayafas 


Abstract-- A general procurement methodology, carried 
out by a Web based tool called “T-Pro” is proposed in this 
paper. This tool can be used by people who have to select 
the appropriate telecommunication services for their com- 
pany. The full range of these services is specified by the 
European Telecommunications systems Interconnection 
Profile (EOTIP). These services are based on published 
European and global standards and provide the cost effec- 
tive support necessary for the operation of almost any 
modern business.The physical architecture of the “T-Pro” 
tool consists of a Web Server, a Repository Server and the 
appropriate User Interfaces. The Web Server processes the 
client requests and, by using decision logic, accesses the 
repository and dispatches to the clients the requested data. 
The Repository Server, being a member of the object- 
oriented category, is used for storing and retrieving the 
information. The User Interfaces are implemented by using 
various technologies like ASP (Active Server Pages) and 
DTC (Design Time Controls). 


Index Terms: Telecommunication Procurement, European Open 
Telecommunication Interconnection Profile (EOTIP), End-User 
Types, End-User Services, Management Services. 


I. INTRODUCTION 


This paper refers to a new approach for telecommunication 
products procurement through the Internet, by modeling the 
procurer’s needs on the basis of EOTIP [1] methodology. It 
describes the Web-based application developed in the 
framework of the European project “GEAR”, which aims to 
increase the usability of the EOTIP, by delivering a flexible 
and user-friendly software tool. 


II. BACKGROUND TO EOTIP 


A. Purpose of EOTIP 


The European Telecommunications systems Interconnec- 
tion Profile (EOTIP) [1] fulfils two general objectives re- 
lated to the procurement of Telecommunications equipment 
and services. These are: 
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1. To specify a full range of up-to-date telecommunica- 
tions services which, as far as possible, are based on 
published European and global standards and which 
provide the cost-effective support necessary for the op- 
eration of almost any modern business. 


2. To simplify the selection of the telecommunications 
services which are appropriate to the business needs of 
any particular organisation 


The services specified within EOTIP take into account the 
current capabilities of existing equipment and networks 
while also considering the operational needs of organiza- 
tions in today’s business environment. 

EOTIP can be applied by any organization to analyze and 
describe their telecommunications needs and procure the 
necessary telephone equipment and services. The method- 
ology describes the telecommunication features in a neutral 
way referring relevant standards where they exist. 

The EOTIP analysis process involves the classification of 
users and the selection of the appropriate services for each 
type. The selection procedure consists of identifying the 
needed services from tables listing procurement clauses of 
Standardised and Non-standardised Services, Network and 
Management Services, along with an identification of the 
importance of the service to be used, as well as an identifi- 
cation of the appropriate standard related to the service. The 
resulting lists are enriched by the identification of the ac- 
cess methods and the required interfaces. 


B. Classification according EOTIP 


The steps followed in the EOTIP methodology consist of 
[2]: 

1. Classification of all staff members as End-User Types 

(part 0). 

This part establishes the background to EOTIP and the gen- 
eral context in which it is to be used. It also describes a 
simple organisational model which can be used to analyse 
any business or public service enterprise and upon which 
the subsequent selection of telecommunications services is 
based. 
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2. Selection of End User Services for the End-User Types 
(Part 1) 


This part of EOTIP [2] describes a full range of telecom- 
munications services, which can be used to improve the 
productivity of individual end-users. It also indicates which 
services should be considered for each of the end user types 
identified in the EOTIP Model. In addition, it identifies the 
standardised communications interfaces (for example, 
ISDN and GSM) which could be considered for each user 
type. 

3. Selection of Network Services (Part 2) 


Part 2 of EOTIP [2] identifies and describes an extensive 
list of features, which can be used to support the users as 
functional groups within the overall organisation, rather 
than as individuals. Its main emphasis is on services for the 
effective handling of incoming calls. 


4. Selection of Management Services (Part 3) 


Part 3 of EOTIP [2] identifies and describes a set of net- 
work management services, which enable an organisation’s 
telecommunications services to be operated efficiently and 
cost-effectively. Criteria are provided to simplify the se- 
lection of the appropriate services for a particular organisa- 
tional structure. 


5. Compilation of the results (Part 4) 


When the organization has been modeled and the necessary 
telecommunications services have been selected, the results 
should be used as the basis for a procurement specification. 
Further specification of information regarding parameters, 
limits and options can be added and EOTIP will help to 
identify where such information is needed. 


Ill. T-PRO 


The tool produced, under the name of T-Pro, is sophisti- 
cated, IT-based and eventually eases the process and gives a 
full range overview of the services to be selected, along 
with the related standards. All necessary data is stored in a 
dynamic information repository, providing the user with 
flexibility in searching and selecting the appropriate data, 
according to the EOTIP methodology. 

The dynamic database structure allows the use of the meth- 
odology, either by restructuring and re-linking the data ac- 
cording to selected criteria, or by adding new modules in 
the procedure, without causing an overhead to the applica- 
tion user. 

The T-Pro tool aims to increase the usability of the EOTIP 
methodology, by developing and delivering a software tool 
which will help procurers in assessing and applying the 
EOTIP methodology in a more efficient and user-friendly 
way. The methodology's process of analyzing telecommu- 
nications needs and purchasing telephony equipment and 
services would gain respectively in terms of usability and 
quality. 


A. Objectives of T-Pro 


The objectives of T-Pro are to: 


1. Perform a software implementation of the EOTIP 
methodology where users can specify their needs from 
the business and work practice point of view. 


2. Help users apply EOTIP through a stepwise procedure 
using online electronic forms, context sensitive help 
and dynamic questionnaires according to the user’s 
general questionnaires and NxN clause-services matri-. 
ces. 


3. Offer a tool set based on a powerful object repository 
providing intelligent EOTIP knowledge management. 


4. Use an open architecture for the tool based on a re- 
pository that enables future updates. 


B. Overall Architecture 


The application operates in a three-tier client/server envi- 
ronment. This approach establishes a communication of a 
number (one to several) of clients that are running the front- 
end application to the back-end database engine, typically 
running on a separate computer [3]. The clients use a 
lightweight browser approach, based on the Active Server 
Page technology (described later in this paper), to commu- 
nicate and request data, while the decision logic and the 
actual EOTIP data will reside on the server. The server, 
containing the actual data-store and the meta-logic, will 
inter-operate with the www service, implemented through 
the use of Active Server Pages that will provide the inter- 
connection of the clients to the actual repository. Advan- 
tages of this approach include: 
1. the best possible functionality exploitation of the data- 
base server logic 
2. the reduced size and load of the front-end application 
running on the client 
3. independence gain, due to the fact that server logic may 
be updated to reflect new business requirements without 
changes in the client front-end software. 
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Figure 1. The architecture of T-Pro. 


The Web Server processes the client requests and by using 
the decision logic, accesses the repository and dispatches to 
the clients the requested data. 

The Repository Server is used for storing & retrieving the 
information, using the Meta model. It also separates differ- 
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ent projects into separate workspaces and handles security, 
administration and configuration management issues. 

The User Interface allows the user to handle the EOTIP 
methodology by using a two-pane, explorer-like, interface 
that expands along with the user input. The questionnaires, 
along with the current structure appear in more detail in the 
right pane of the tool. Relations with any object in the re- 
pository (e.g. services) of the currently selected item and its 
descending route, along with information for items shown 
in the path, are displayed as well. The navigational model 
is part of this interactive User Interface, visualising and 
guiding the user through the procurement planning process, 
interacting accordingly with the decision logic. 
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Figure 2. Interface for the procurer (C-User) 


C. T-Pro’s Repository 


The object-oriented repository used for this implementa- 
tion named Enabler™ is provided by Softlab GmbH. 
This product, is an open object repository for Microsoft 
Windows or Unix environment [4]. It provides the foun- 
dation for integrated storage and management of data 
along with flexible and high-performance access to this 
data by users and applications. 


Figure 3. The repository implemented to T-Pro. 


D. Physical Architecture of Enabler-Suitability for T-Pro 

Enabler runs in a distributed client/server environment and 
its server part provides the necessary repository and inte- 
gration services for client access. The Enabler client soft- 
ware runs under either Windows NT or Windows 95 and 


includes the Enabler tools (e.g. the administration tool and 
the modeling tool) as well as integration services for other 
client applications with access to the repository [5]. 
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Figure 4. Physical architecture of Enabler 


The services provided by Enabler allow the modeling of a 

direct representation of the telecommunication needs ac- 

cording to the EOTIP methodology. The Enabler data 

model, for the T-Pro tool, describes the elements of the 

telecommunication procurement as objects whose proper- 

ties are represented by attributes. Existing connections be- 

tween the objects are modeled as relationships, which also 

have attributes describing them. Objects and relationships 

are grouped into abstract object classes and relations. Mod- 

eling the EOTIP methodology in Enabler provides the fol- 

lowing benefits: 

1. complex EOTIP methodology is modeled with a com- 
mon structure, allowing updates. 

2. changes and extensions to the data model, can be made 
while the repository is running 

3. refinements and enhancements can easily be performed 
by the Suppliers side. 

4. users can retrieve information about the EOTIP model 
at any time. 


E. Logical Data Specification (Meta-Model) 


The T-Pro’s logical architecture consists of three parts as 
depicted in Figure 5: 


1. The data model of the C-Users (Clients), containing the 
name and address of a C-user along with his require- 
ments and the description of this company. 

The EOTIP-metamodel as defined in [2]. 


3. The data model for the S-User (Supplier), containing 
the address, the contacts and the list of their products. 
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Figure 5. Logical Architecture of matching 


IV. SOFTWARE BACKGROUND 


A. Introduction 


The implementation of the User Interface, as described ear- 
lier in this paper, is done through the use of the Active 
Server Pages (ASP) technology [6]. This technology, which 
was first offered by Microsoft’s Internet Information Server 
version 3.0, combines support for server-side scripting with 
support for component use on the server and a model to 
provide a Web programming platform. Active Server Pages 
is compatible with any browser that supports cookies and 
HTML 3.2. 


B. Benefits 


This technology provides a great many solutions [7],[8], the 

most important of them being: 

1. solution to concerns about exposure of proprietary busi- 
ness logic by moving such logic to the server where it 
can be secured 

2. solution to the challenge of HTTP statelessness by pro- 
viding intrinsic objects that allow ASP applications to 
keep track of both session and application state 

3. solution to the problems inherent in other scripting tech- 
nologies (CGI) by providing simple use of components 
in addition to server-side scripting 

4. ASP can be used as glue for components [7], setting 
properties and calling methods. These components can 
be built to return results as HTML, so there is no need to 
write the code to the browser 

5. ASP has a simple, programmable object model, thus pro- 
viding services such as session management, user input, 
browser output and component creation [8]. Therefore, 
much of the complexity of server programming is ab- 
stracted, allowing the developers to concentrate on de- 
veloping their application, rather than the services re- 
quired to support it. 


C. Components 


The T-Pro tool uses controls, which are called Design Time 
Controls (DTC). These controls allow Web programmers to 
insert code into their ASP pages. In this way we were al- 


lowed to design our pages with a WYSIWIG editor, insert 
(drop) appropriate Enabler components, define the proper- 
ties and preview the result. The application environment 
used for the construction of the user interface was Micro- 
soft’s Visual Interdev 6.0 


V. STANDARDS USED 


The tool is created on the basis of EOTIP, which aims to 
establish a standard methodology for the telecommuni- 
cation procurement especially for large enterprises and 
organizations. At the same time EOTIP serves as a 
guideline for potential suppliers as to which standards 
must be met in the future to fulfill communications re- 
quirements. 

Consequently, T-Pro states the operational needs of or- 
ganizations in today’s business environment with refer- 
ence to the relevant standards when they exist. 


VI. IMPACT TO THE MARKET 


The main difficulty in attempting to determine categories of 
users and services within the framework of EOTIP is the 
diversity in the character, the structure and, thus, the re- 
quirements of each organization. The manipulation of dif- 
ferent situations and the accumulation of experience may 
continuously add further requirements. As the current clas- 
sification of end-user types does not cover all users, either 
as individual or as groups, in a clear and distinctive manner, 
an emphasis will be given to an open architecture of 
EOTIP, so that it will be able to allow for refinement (sub- 
divisions of existing end-user types), extensions (enrich- 
ment by additional types) as well as restructuring. The ob- 
ject-oriented rationale of T-Pro will meet these require- 
ments. 


VII. CONCLUSIONS 


T-Pro is a software tool that eliminates the current lack of 
process interactivity providing automatic answers after a 
given input. It would not only speed up the process of ap- 
plying EOTIP, but would also make the most proper identi- 
fication of End User -Types and Services, leaving less room 
for incorrect selections. 
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An Internet Application for Home Automation 


Renato J. C. Nunes, Member, IEEE, and José C. M. Delgado, Member, IEEE 


Abstract--This paper describes an Internet application that 
allows local and remote monitoring and control of a home. The 
application adopts an object-oriented philosophy, in which 
every home automation device is represented by an object. This 
leads to an integrated view of the home, with a common look 
and feel across all the devices, while supporting various 
methods of actuation and sensing. 

The application runs in a PC with a web server. The user 
interface is done through a standard browser using common 
HTML pages or Java applets. 

The interaction with the home control system(s) is done 
through specific hardware or PC standard interfaces such as 
the serial port or USB, or by LAN. 

The application offers a powerful way of controlling and 
programming the behavior of the home, using scripts. For the 
common user, a simpler version of a script - a scenario — may 
be used. A scenario consists of a set of devices in a particular 
state and can be activated directly by the user, by a time 
stimulus or by any event in the home. 


Index terms—home automation, internet application 


I. INTRODUCTION 


Home automation is a very promising area. Its main benefits 
range from increased comfort and greater safety and 
security, to a more rational use of energy and other 
resources, allowing for significant savings. It also offers 
powerful means for helping and supporting the special 
needs of people with disabilities and, in particular, the 
elderly. This application domain is very important and will 
steadily increase in the future. 


Although home automation offers many advantages, it has 
not yet achieved the broad acceptance one would expect. 
The reasons for that are many and not all of them are well 
identified or understood. One important aspect is related to 
the small ratio between the features offered and the cost of 
the current solutions. This is due, in part, to the lack ofa 
powerful standard technology with broad market 
acceptance. This would allow the appearance of a multitude 
of different products, compatible with each other, even if 
produced by different companies. Besides this advantage, 
one would also expect a price reduction due to increased 
market competition. 


The authors are Assistant Professors at IST - Technical University of 
Lisbon and researchers at INESC. E-mails: Renato.Nunes@inesc.pt, 
Jose. Delgado@inesc. pt 


Another very important aspect regards the interface with the 
users - the home dwellers. It is not enough to have a 
powerful and standard technology. It is also fundamental 
that the user can configure the system accordingly to his 
house and his specific needs and can control the system 
functionality easily, on a daily basis. 


The work presented in this paper tries to contribute to the 
home automation field by addressing several problems: 

1. Configuration of a home automation system to a 
specific house; 

Transparent interaction with one or more home control 
systems; 

3. Loca! and remote access to the home; 

4. Simplicity of the user interaction. 


N 


The paper starts by describing the general architecture of 
the system and identifying its main components and their 
roles. Then the conceptual model of the system is presented, 
using UML, and the various classes that allow to configure 
and to program the system's behavior are described. 


II]. GENERAL DESCRIPTION OF THE SYSTEM 


In figure 1 we present the general structure of the proposed 
system. It is centered on a personal computer that is 
connected to the Internet and eventually to a LAN. This 
computer — the Home Server — connects to one or more 
home control systems and runs a home automation 
management application that supervises them. Each control 
system may be responsible for a specific functional area 
such as lighting, air-conditioning, safety, security, video 
surveillance, etc. The Home Server will encapsulate the 
particularities of each system and offer a common and 
global interaction means with all of them. 


All of these systems must be able to interact and be 
controlled by the home server. In this way, multiple control 
systems can coexist in a single home, even if they use 
different technologies, such as X10, CEBus, LonWorks, 
BatiBUS, EIB, etc. At the physical level, these home 
control systems connect to the home server by a simple 
interface such as the serial port or USB, by an add-on board 
or by LAN. At the protocol level, it is up to the management 
application to interface with the various systems. 
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Figure 1 - General Architecture of the System 


The Home Server may be accessed using any computing 
device equipped with a standard web browser. This is 
clearly an advantage of this architecture based on the 
Internet, as it allows the system to be access transparently 
from multiple places, locally (through a LAN) or remotely. 


Considering the recent trends, in the near future almost any 
computing device will have Internet access and could be 
used to monitor or control the home. Of particular interest 
are the PDAs (Personal Digital Assistants) and the new 
generation of mobile phones that are Internet enabled. Who 
wouldn't be interested in controlling and knowing what is 
happening at his home from virtually anywhere in the 
world? 


II]. INTERNAL STRUCTURE OF THE HOME SERVER 


In this section we describe briefly the internal structure of 
the home server — see figure 2. The key element is a process 
called Home Automation Manager that supervises the whole 
system and is responsible for the interaction with the users. 
The Home Automation Manager uses a database to keep all 
the information regarding the system configuration, the 
current state of each home device and the actions to perform 
on the system accordingly to the date and time, user actions 
or events occurred in the home (e.g., a light switch pressed 
by the user). 


The Home Automation Manager interacts with the world 
through two channels. One uses a set of Internet services 
and, in particular, an HTTP server, to support the 
interaction with the users, local or remote. The interaction is 
based on standard browsers, HTML and Java technology. 


The second communication channel used by the Home 
Automation Manager supports the interaction with the 
control systems of the home. This is achieved through a 
software layer that dialogs with specific hardware interfaces 


to access the control systems. This software layer 
encapsulates the particularities of each system and offers a 
simple mechanism to control each home device and to get 
its current state. 


Internet 
Services 


Home 
Automation 
Manager 


Figure 2 - Internal Structure of the Home Server 


IV. SYSTEM CONCEPTUAL MODEL 


This section describes and explains the conceptual model 
used in the design of the Home Automation Manager. The 
main objectives of our approach are the following: 


1. An easy and generic way to configure a home 
automation system to a specific house, taking into 
account its physical particularities (space layout) and 
the type of functionality required (from which we can 
derive the number and type of devices to control); 

2. Interaction with one or more home control systems 
(each one, for example, covering a specific functional 
area such as lighting, air-conditioning, security, etc.) in 
a global way and independently of its manufacturer and 
technology used; 

3. Local and remote access to the home, for monitoring or 
control purposes, using a unique and global interface, 
based on a web browser and using Internet standard 
technologies; 

4. Simple and intuitive user interface, adequate to the 
needs of the home dwellers, even if they don't have 
specific knowledge on computers, by using a simple 
programming model to define the behavior of the home. 


The application adopts an object-oriented philosophy, in 
which every home automation device is represented by an 
object. This leads to an integrated view of the home, with a 
common look and feel across all the devices, while 
supporting various methods of actuation and sensing. 


Figure 3 depicts the conceptual model (simplified) of our 
approach, using UML — Unified Modeling Language [6]. 
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Figure 3 - Conceptual Model of the System 


This model is described and analyzed in the following 
sections. 


V. CONFIGURATION OF THE HOME SYSTEM 


There are three main classes that support the definition of 
the devices in the system. These will be used whenever a 
system is installed in a new home or changed in an existing 
installation: Device_Type, State_Type, and Argument_Type. 


The Device_Type class identifies the various types of 
devices (e.g., TV, lamp, air conditioner, and temperature 
sensor). The objects that represent devices are not instances 
of this class. Instead, they all belong to the same class, 
Device, that has an attribute of class Device_Type. The class 
Device includes the code and data common to all devices, 
such as the icon (the path of the file containing the bitmap 
representing the device on the screen). In an object-oriented 
model there should be a hierarchy of classes inheriting from 
Device, one for each type of device. But this is a static 
structure, that needs recompilation to create a new device. 
In our approach, the user can dynamically create new types 
of devices by merely describing its particular features, that 
is, filling in the attributes of Device_Type. These are, 
mainly: name, a unique ID, functional category (e.g., 
lighting, air conditioning, and security), and a state list that 
specifies the states in which each device (of this type) can 
be. All the instances of Device of the same device type (e.g., 
all lamps) share the same instance of Device Type. 


The state notion is fundamental in our approach and 
apparently is a step back in terms of object-oriented 
technology. A possible approach would be to characterize 
each device type with a set of attributes, each with a 
possible set of values. As an example, an air conditioner 
could have the following attributes: ON_OFF (boolean), 


VENTILATION_ONLY (boolean), FAN_SPEED (low, 
medium, high) and SET_TEMPERATURE (integer or 
float). The same would be done for all the types of devices. 
We think, however, that having all the characteristics at the 
top level is confusing to most non-expert users, and we 
turned to a simpler approach instead, in which each device 
has one single state at any moment. For example, a lamp can 
be ON or OFF, a video recorder can be OFF, PLAYING or 
RECORDING, and an air conditioner could be OFF, 
VENTILATING, HEATING or COOLING. This is easily 
grasped by anyone. It also highlights the states common to 
the various types of devices (such as OFF, PLAYING for 
VCRs and HiFis). Now this is too simple to specify 
everything, and the additional settings are expressed as 
arguments to each state. For example, the HEATING state 
of the air conditioner would have two arguments, 
FAN_SPEED and SET_TEMPERATURE. In the end, the 
same information has to be specified, but our approach has 
the advantage that it highlights the states (only one at the 
time) and relegates the detailed settings to a second level. 


Therefore, the class State_Type represents the states, which 
can be shared by devices of different types (e.g. PLAYING 
for VCRs, HiFis and DVDs). The main attributes of this 
class are: name, a unique ID and a list of arguments. The 
latter are described by class Argument_Type, whose main 
attributes are: name, type (integer, float, boolean, string), 
maximum and minimum values. 


It should be emphasized that classes Device_Type, 
State_Type, and Argument_Type are merely description 
classes. There are other classes, such as Device and State, to 
produce the actual objects that as a whole implement the 
behavior of the system. Many instances of device can share 
the same instance of Device_Type, without the need of 
redefining the set of possible states and their arguments. 


To simplify the organization of the various devices, we have 
introduced the class Category, which describes a functional 
area, such as lighting, air-conditioning, security, 
entertainment. Each device type belongs to one category 
only. The Device_Type class has an attribute of this class. 
Examples of device types in category entertainment: TV, 
VCR, HiFi and DVD. This functional view is important as a 
browsing aid and a means to perform global operations on a 
given category of device types. 


We also need to structure devices in respect to their 
physical location in the home (story, room, closet, and so 
on). We have introduced the class Location, which allows to 
specify that a given device is located in the kitchen, in the 
children bedroom, in the first floor's bathroom, in the 
basement, etc. The class Device includes an attribute of 
class Location. 
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VI. DEFINITION OF THE HOME SYSTEM BEHAVIOR 


The classes described so far support the definition of a 
given home system, in terms of its devices and their 
functional organization and physical location. The 
remaining classes pertain to the behavior of the system, the 
states of the devices and the programming of manual or 
automatic actions. 


At any moment, it is possible to know the state of the 
system by asking each device which state (instance of class 
State) it is in and the value of the respective arguments 
(instance of class Argument), if the state has them. Actually, 
there is a hierarchy of classes inheriting from Argument, to 
contemplate the various types of arguments. This has been 
ignored in the model for simplicity. 


In terms of behavior, the class Action specifies an individual 
operation that sets a given device in a given state (qualified 
by the eventual arguments). 


The Script class supports more complex behaviors. A script 
is a collection of actions and other scripts in the form of a 
program, with simple control structures, such as if and while 
statements. 


The execution of scripts (which can consist of a single 
action) is triggered by events. There are three types of 
events, described by classes User_Events, HW_Events and 
Time_Events. Each of these events specifies the script to 
execute when it is triggered. User events are triggered by 
the user by pressing buttons and selecting menus in the user 
interface. Hardware events happen as a result of a change in 
some hardware signal (such as the user pressing a light 
switch, a temperature sensor activating a line due to 
temperature increase above a given preset level, or a 
presence sensor detecting movement). Time events have a 
day and hour specified and are triggered when that time is 
reached. It is also possible to define repetitive time events 
(always at 12:00 on Mondays, for instance). Each type of 
event is managed by its own event manager class: 
User_Event_ Manager, HW_Event_Manager and 
Time_Event_Manager. 


The scripts have a high degree of flexibility and generality, 
but this is mainly useful for the experts to program basic 
functions. The non-expert user finds this complex and 
prefers to use the notion of scenario. This corresponds to 
setting a given collection of devices to a specific set of 
states. There is no specific class corresponding to this 
concept. It is simply implemented by a script without 
control statements, that is, a simple list of actions. Examples 
of typical scenarios are: do-not-disturb (silencing phones 
and door bells), no-one-at-home (set up the alarm system 
and disconnect all lights, air conditioners, etc), TV- 
watching-in-dinner (switch the TV on, set up the lights to a 
given level), sleep (set up the bedroom light to dim and to 
be switched off half an hour later, etc). It should be noted 


that a scenario does not have to specify all the devices at 
home! Only the relevant ones are specified. It is also 
possible, within a controlled simplicity, to use conditional 
scenarios, in which the actions to perform are conditioned 
by if statements. This allows to automatically adapt a 
scenario to the current environment, such as, for example, 
turn a lamp on or not according to whether it is night or day. 
Scenarios are set by any of the events described above, at 
the initiative of the user in the user interface, by a time 
event or as result of a hardware event (which allows to set a 
scenario by pressing a button on the wall or by pressing a 
key on an infrared remote control). 


VII. USER INTERFACE 


Due to lack of space, the user interface is not detailed here. 
It consists of a Java applet which has the following main 
features: 

1. Configuration of the system by creating new instances 
of Device_Type, State Type, Argument_Type and 
Location; 

2. Programming the system by defining events and actions 
at two levels: scenarios (simpler level) and scripts (full 
flexibility); 

3. Monitoring the system, offering two views: physical, in 
which the location of the devices becomes explicit, and 
functional, in which objects are organized according to 
their functional category. 


VIII. CONCLUSIONS 


This paper presented a home automation system that 
addresses the problems of configuration, interaction with 
multiple control systems and interface using a web browser. 
The model of the system was described. Deviations from a 
pure object-oriented model were made, justified by the 
increased simplicity to the user. The resulting system is 
layered in terms of complexity, hiding the complexity from 
the common user but allowing a more expert user to 
program the system with full flexibility. 
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Design of a web based interactive 
Computer lab course 
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Abstract-- The information age has created many challenges 
for the traditional established educational institutions. This 
paper describes the design of web based computer lab courses. 
The purpose of this effort was the support of the office 
automation courses and the preparation of a series of distance- 
learning courses. Also the integration of the WWW (and of the 
Internet in general) in the teaching process is forcing students 
to familiarize themselves with the use of this valuable tool. 


Index Terms-- web, distance learning, interactive, HTML, java 
script. 


I. Introduction 


The information age has created many challenges for 
the traditional established educational institutions. The 
Internet has been widely used by scientists for 
communication between each other and access to remote 
search centers. With the fast development of computer 
networks, people can access information and communicate 
with others without being constrained by space and time. 
Thus scientists are now able to discuss things with others, 
exchange ideas, in order to solve existing scientific 
problems. The most dominant form of computer-aided 
communication is the World Wide Web (WWW). 

In the last ten years, there has been considerable 
interest in the educational applications of electronic 
networking. The extensive literature presents numerous 
examples of the ways in which electronic networking can 
facilitate educational outcomes in these learning 
contexts.[1-4, 6]. The use of the WWW has been adopted 
into every aspect of the educational life [1]. One of the area 
in which WWW has made a great impact is distance 
learning [2]. It has offered the ability to teach students 
located in remote areas and students with disability 
problems. Many universities have started offering distance- 
learning courses. From the beginning of 1999 the Aristotle 
University of Thessaloniki has been involved in such 
courses. Information concerning this project (in greek only) 
can be found in http://distance.csd.auth.gr/. 

Based on the experienced obtained from the above 
project we have developed web based computer lab courses 
for the students of my department. The courses are focused 
on office automation applications. This paper describes the 
design of these web based computer lab courses. The 
purpose of this effort was the support of the office 
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automation courses and the preparation of a distance- 
learning course. Also the integration of the WWW (and of 
the Internet in general) in the courses is forcing students to 
familiarize themselves with the use of this valuable tool. 

The rest of the paper is organized as follows: Section 
II discuses the integration of www into the courses. Design 
issues are being considered in Section III. Exam and 
evaluation are presented in Section IV. Student’s reactions 
are included in Section V. Concluding remarks and future 
extensions of this work can be found in Section VI. 


II. Intergrading the WWW into the course 


Although Internet is a significant tool that combines 
delivery medium with instructional content and course 
management, effective use requires careful consideration. 
The academic community supports the use of the Web as a 
tool to enhance teaching and learning. But several 
considerations must be taken into account before 
intergrading the Web into the courses. Different levels of 
integration are appropriate for different situations. Another 
aspect is the level of the infrastructure technology available 
for the students. Bear in mind that the new technology 
might create new barriers the students. 

The successful integration of the WWW into a 
course depends on some basic requirements: 

e The students are able to connect to the Internet. They 
know how to use and have access to the necessary 
hardware. 

e The students can use a browser effectively. They can 


Fig. 1. Users connecting to the web-server. 
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find and retrieve information easily. An introductory 
course on web browser use may be required. 

e The Internet is a rapidly changing environment. 
Today’s links may not be active tomorrow. So frequent 
updates of offered links is necessary in order for a web 
site to be fully functional. 

The Internet has a moderate use in Greek 
community. The latest data on the number of greek Internet 
users estimates that around 4% of the population is using 
the new communication medium. But experts predict that 
by the end of 2000 10% of the population will have access 
to the Internet. The Greek students are able to access the 
Internet via the Greek University network (GUNET — 
http:/Awww.gunet.gr). 

Based on the above the method we have selected for 
intergrading WWW into the courses was the introduction of 
additional material for the course in the form of web pages. 
Instead of supplying students with pages (paper) we gave 
them the URL address of the html pages. The goal is to 
complement the learning material. The material in the web 
pages is updated regularly. The students were able to access 
the pages from the Department's computer lab or from their 
home computer by dial up connection to the university 
campus (figure 1). The above method is very suitable for 
teaching material that changes very often (like computer 
applications). The students can communicate with us via e- 
mail. Each week assignments and course preparation 
material were distributed via e-mail. 

Students found the use of a browser (in this case, 
Internet Explorer), as the medium for delivery, was 
generally satisfactory although there were some difficulties, 
which were related to the use of a network. In particular, 
interactive demonstration material was not included in the 
web pages because it was found to take too long to load. 
The same phenomenon was experienced with text-based 
materials, when the network was under heavy load. All the 
above problems were experienced when students were 
accessing the web pages via dial up connection. 


III. Web course site design 


A good web site must include some basic 
characteristics. It is not necessary to be a sophisticated web 
site. Even a simple web page with some links to other web 
pages may be adequate. The features that must be included 
are: 

e Course outline with links to details, presentation pages 
or data files that can be downloaded. This allows 
maximum flexibility for today's older or working 
student 

e Assignments (with fixed deadlines). They can be 
offered in the form of web pages or download files. 
Thus modem access to the web site enables students to 
take courses when and where desired. 


e E-mail link to the professor. The use of e-mail can 
increase student-faculty contact and make the contact 
time and place independent. 

Building a web site can be a pleasant or frustrating 
experience depending upon your experience, and the tools 
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links to various subjects 


e-mail to the instructor 


Fig. 2. The introductory page of the QuarkXpress 
course. 


you employ. Web pages are written in HTML, hypertext 
markup language, to encode them for transmission over the 
Internet. Although HTML language is not difficult to learn, 
various tools exist that let us create more easily web pages. 
The first choice is to employ a standard word processor 
application that offers the ability to save in HTML format. 
Word for Windows of Microsoft Inc is a typical example of 
such an application. The results of such an operation are 
moderate, but can be used as a first attempt to create a web 
course. A more professional approach is the use of web 
page editors, like FrontPage. These editors can help us 
create full range of web pages, manage the web site and 
include templates, and wizards features for rapid web page 
development. 

Until very recently, the typical web educational 
systems were largely non-interactive in nature offering 
static tutorial or educational material on a variety of topics. 
Currently some web pages with emphasis on the use of 
interactive components as fundamental parts of the learning 
environment are beginning to be experimented with [6]. In 
order to achieve such interaction we must include scripting 
with program languages such as Java. Web browser 
software decodes the HTML and scripts to display the page 
on the viewer’s screen. 

Some pages of our web course can be found in 
figures 2 and 3. The pages employ frames. The left frame 
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Fig. 3. A DTP exercise offered through the web 
course. 
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includes buttons that lead to various subjects of the course. 
An e-mail link to the instructor is also available in every 
page. More precisely figure 2 includes the introductory 
page of the QuarkXpress course and figure 3 presents a 
web-page with a DTP exercise that students must complete. 
These pages were developed with the help of FrontPage 
application. 

The problem with the pages that employ frames is 
that sometimes the user has difficulties to navigate through 
the web pages. The problem can be solved by using a java 
script that allows the creation of an hierarchical tree 
structure of the course material (fig. 4). The user can 
expand or collapse the tree branches in order to find the 
information he desires. This hierarchical structures can be 
placed in a frame located on the left side of the page, and 
the main information will appear on the main frame of the 


page. 


Menu 

indroduction 
Environment 
Working with text 
Connecting text boxe 
¥¥orking with images 
DTP exercises 


Fig. 4. Hierarchical tree structure created with a java script 


The use of java scripts, which are executed on the 
client machine, provides fast response (avoiding CGI 
technology). Basic knowledge on programming is adequate 
in order to make the necessary changes in the java script in 
order to implement the desire type of exercise. Suitable java 
scripts can be found on various web sites. 


Choose dre correct answer: 


1. What is the function of this button i. 


© @_ Saves our work fo the aiskete 
& b_ Saves our work te the hard disk 
© ¢_ Checks the integnty Eskette 

© d Saves our work 


2 We can copy text from one position to another by: 
© a. Pressing the Cut and Paste buttons 
& b. Seliecting text and pressing the paste bexton 


© ¢. Sellecting text and pressing the Copy and then the paste buttons 
~ d_ Selecting text and pressing the Cut and then the peste buttons 


You have found _ 
Fig. 5. An interactive test. 


IV. Exams and evaluation 


Exams aim to assess primarily the level of 
preparation reached by the students. Also they can be used 
to evaluate the effectiveness of the teaching methods 
adopted. In our case some knowledge test were added in the 


web pages, in order for the student to be able to directly 
assess his skills in a particularly course. 

The tests were developed with the help of Java. They 
include the following categories of tests: 


e  True/False 
e 1 outofN choices 
e MoutofN choices 


Images can also be included if it ts necessary. In 
figure 5 a part of typical test is presented. A “Results” 
button is also included in order for the student to immediate 
acquire his test results. The “New test” button kets the 
student begin the test again. 

Eventually these tests will be imtergraded to the 
weekly lab course. The student will have to prepare for the 
course by studying certain material at home. In the first 
hour of the lab course students will have to complete web 
tests and the results will be directly mailed to the mstructor. 
If the student passes the test he will be allowed to aftend the 
lab course. 

Another idea, which is still under development, is to 
assign to each student user name and password. Each 
student will be able to login to the server, access active 
server pages, and complete the same test, the results of 
which will be mailed directly to the imstructor. Thus the 
instructor will be able to monitor the studying time of each 
student. For security reasons this feature will be 
implemented only for users logging m from the 
department’s computer lab. Flexible time constramts may 
be added to insure that all students will access the same 
course material with roughly the same pace. 


V. Student's reactions and statistics 


The introduction of web pages into our lab courses 
was met with great enthusiasm from our students. 94% of 
the students used the www pages during the semester 
(figure 6). Most of them expressed the desire for other 
courses of the Department to implement www pages. Some 
other statistics were also interesting. The majority of the 
users accessed the web pages form Greece but we had also 
some foreign hits from U.K. and Norway. This 3s due to the 
fact that our department has also foreign students that were 
able to access the courses from their country. 84% of the 
computers that access our web page used 800X600 screen 
analysis. That explains why we have chosen to implement 
our pages in 800X600 analysis even if users with computers 


Fig. 6. Percentage of students accessing the web 
courses. 
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days 


Fig. 7. Average number of access of the web courses 
during the week 


with lower screen analysis (640X480) have some difficulty 
observing all the material available in one web page. 

In 7 we present the average number of access to our 
web pages during the days of the week. The peek values 
were exhibited during the days that the lab courses took 
place. It is worth noting that even on Saturday and Sunday 
some students access the web pages. 

Next we present in figure 8 the average number of 
access per hour in a 24 hours period. The peek hours were 
exhibited from a time period (12:00 to 17:00) during which 
students have more free time. Considerable access was also 
observed during the late night hours. The later in 
conjunction with the access during Saturday and Sunday 
indicates that students use the dial up service to access the 
web pages from home and study during the hours they 
desire. 
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Fig. 8. Average number of access per hour in a 24 hours time 
period. 


VI. Conclusions and future work 


Facilitated learning or independent learning courses 
(courses existing entirely or partially in electronic form) 
available in open-access laboratories can provide a high 
degree of flexibility for students who can come to the 


university but find it difficult to follow the rigid scheduling 
of classes. A key element in the above uses of academic 
computing is that professors retain contro] over content, 
standards, design, and assessment of student learning while 
giving up control of delivery of information. The role of 
teaching changes from information provider, to information 
guide. 

As modern computing environments and networking 
infrastructure arrives on the doorsteps of more and more of 
us, we are experiencing an increasing pressure to make 
effective use of this technology. University faculty 
members feel this pressure most, because the students, they 
deal with, are the most mature, and the technology in our 
workplaces tends to be of the most advanced type, and thus 
the opportunity and the benefits are greatest. Despite having 
motive and opportunity, the uptake on using information 
technology as an integral part of university education is 
sporadic, with some experiment and innovation, but little 
universal impact. 

Some special tools for designing, organizing and 
maintaining web courses have appeared in the literature (an 
extensive list can be found in http:// www.umr.edu /~rhall/ 
research /weblearning/ products.html). But most of them 
are special purpose applications that have been developed 
for specific needs and are difficult to be adopted, 
customized and learned by faculty members with basic 
computer skills. Thus we are currently involved in 
developing a software tool that will be based on well- 
known computer applications (like word-processors) that 
will allow educators with minimal computer knowledge to 
create web courses. 


References 


{1] D.R.MclIntyre and F.G.Wolf, "An experiment with 
WWW interactive learning in university education", 
Computers & Education 31 (1998), pp. 255-264. 

[2] M.Castro et al "Technology innovation and 
integration in distance education" in proc Melecon 
98 Vol. 1, pp. 164-168. 

[3] Glenn W. Rowe, Peter Gregor, " A computer based 
learning system for teaching computing: 
implementation and evaluation", Computers & 
Education 33 (1999), pp. 65-76. 

[4] C.D. Smith, H.E. Whiteley, S. Smith “Using email 
for teaching”, Computers & Education 33 (1999), 
pp. 15-25. 

[5] A.Veglis, "Communicating with Greek Newspaper 
via World Wide Web", IEEE  Grlobal 
Communication Newsletter, September 1999, pp.1- 
4. 

[6] Bradford C. Lister, “Interactive Distance Learning: 
The Virtual Studio Classroom”, Proceedings Third 
International IEEE Conference on Multimedia, 
Engineering and Education, Hong Kong, 1998. 


10 Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 305 


306 


Wireless & Mobile Communications I 


Pee. 


0” Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


Management and Control of the Access Network 
Segment of Future Mobile Communications 
Systems by means of Neural Networks 


Ch.P.Sarantinopoulos, D.K.Karagiannis, K.Peppas, V.P.Demesticha, 
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Abstract. An important issue in future cellular communication 
systems is the design of the interconnecting network of their 
access segment and the best distribution of the systems’ 
capacity in order to provide the predefined Quality of Service 
(QoS). In this context the following topic arises; to find the 
minimum-cost configuration of Cell Site Switches (CSSs) given 
the Base Transceiver Station (BTS) layout, that satisfy a set of 
performance constraints (that derive from the traffic and 
mobility patterns in the system). An extended version of the 
problem may also comprise the deployment of the above 
configuration in different load configurations. The problem is 
formally defined, optimally formulated, and solved by using a 
neural network technique. Finally, results are provided and 
subsequent conclusions are drawn. 


I. INTRODUCTION 


Future mobile communications systems, e.g., UMTS ([1,2]), 
will have to provide a wide variety of sophisticated services, 
over the widest possible service area. The success of these 
systems will depend on the-Quality of Service (QoS) that 
they will provide, in the most cost efficient manner. In this 
paper we discuss about the design and control of the access 
network segment and the best distribution of the systems’ 
capacity in order to provide the predefined QoS. Our 
reference system is UMTS. However, the practices 
proposed hereafter may be equally applicable to any other 
mobile system of the future. 


The network elements in the access network segment are the 
Base Transceiver Stations (BTSs), which provide radio link 
management, the Ce// Site Switches (CSSs) and the Local 
Exchanges (LEs), which provide switching functionality, as 
well as connection and call control. 


The problem addressed in the context of this study is 
associated with the reallocation of the BTSs to CSSs, given 
the original distribution and the traffic load. The problem 
can be generally stated as follows: given the set of available 
network configuration, i.e. the B7S and CSS, the traffic load 
in each entity, find a new allocation of BTSs to CSSs, that 
maximizes the total carried traffic, by resulting to a load 
balanced network. Our approach in the paper regarding 
these problems may be summarized in the following. First, 
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the problems are formally stated and _ theoretically 
formulated. The problem is solved by using neural networks 
techniques. Finally, the neural network algorithm which was 
deployed is compared to a simulated annealing and a 
genetic algorithm. 


II. PROBLEM STATEMENT 
A. Network Reconfiguration Problem 


Let A. be the allocation of B7Ss to CSSs, that are 


established throughout the network at a certain point in 
time. This allocation designates the expected QoS levels in 
each BTS of the system. Traffic variations cause QoS 
degradations, and hence, a reconfiguration of the allocation 
is necessary. Through the reconfiguration mechanism a new 


allocation of BTSs A,(k) has to be imposed. This 


allocation should possess the following properties. First, it 
should be compliant with the problem constraints. Second, 
it should improve the cost function value C(k), that is for a 


certain set of traffic loads A,, A,_, and for certain 
capacity set g,, ¢,_,, the condition C(k) < C(k —1) 
should hold. Third, the already established allocation should 
be taken into account. That is, 4,(k) should be obtained 


by using all the already established CSSs. The overall 
problem statement has as follows. 


Problem: Given the network configuration, the traffic load 
in the different time-zones A, , the deployed CSS site C, 


the BTS layout V, the capacity thresholds ko, Qe; , of 
each CSS, and the costs of inter-connecting BTSs to CSSs 
Po= {Pacli sD eV,je c} , find an allocation of BTSs 
to CSSs A. (k), that minimizes the cost function 
ff (A. (k )) (associated with the cost of the equipment, and 
the interconnection and signaling cost), subject to the 


conditions g.(C) < ges; and Ic <he GV]. <ic.) 
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II]. OPTIMAL FORMULATION 


For the description of the allocation of BTSs to CSSs 
A.(k), we introduce the decision variables x, (i €V, 
j €C ) that take the value 1(0) depending on whether BTS- 
i is (is not) connected to CSS- 7 . Allocation A. (k) may 
be obtained by reduction to the following 0-1 linear 
programming problem. 


Problem: [Reconfiguring Network Design]: Minimize 


>, D. PoXs ©) 


te¥ jec 

Subject to, 

Dar =! , WieV (2) 
ony ake » WjeC (3) 
ich 

¥ x, PW < ec pe C (4) 


Cost function (1) penalizes the aspects identified in section 
2 (ie., cost of interconnecting the network elements 
deployed, and cost of handovers among BTSs controlled by 
different CSSs). Constraints (2) guarantee that each BTS will 
be assigned to one CSS. Constraints (3) guarantee that CSSs 
will not be assigned more BTSs than allowed by their 
capacity constraints. Thus, no more than & BTSs are 
allowed to be connected to a CSS. Constraints (4) guarantee 
that each CSS will not have to cope with more load than that 
dictated by its pertinent capacity constraint. Thus, the sum 
of the load of all BTSs which are connected to a CSS should 
be less than @.. 


ITV. COMPUTATIONALLY EFFICIENT SOLUTIONS 


This section provides computationally efficient solutions for 
the version of the network design problem addressed in this 
paper. The optimal formulation presented in the previous 
section yields that both the network configuration and 
reconfiguration problems fall within the NP-complete 
category. This means that an optimal algorithm may not be 
able to provide a solution in reasonable time. Furthermore, 
the optimal formulation comprises many variables. 

The usual next step for solving such difficult problems is to 
devise computationally efficient algorithms that may 
provide good solutions in reasonable time. As already 
stated, in this section we present a method based on the 
neural network theory and particular on the Hopfield neural 
network. Afterwards, this algorithm is compared with other 
two well-known methods. The first follows a simulated 
annealing strategy and the second is based on a genetic 
algorithm methodology. 


A. Neural Networks and optimization 


There are a lot of application problems which can be 
formulated as constrained optimization. Neural networks 
provide a suitable architecture for constrained optimization 
where constraints are represented in the global energy 
function of the network and implemented as links between 
the units of the network. The neural network evolution then 
performs gradient descent movements to reach a stable state 
of energy minima. If the energy minima is a global one, the 
optimization is successful, and the optimal solution has been 
found. 


B. Neural Networks for network reconfiguration 
problems 


The energy function and the constraints of the problem are 
presented at the equations (1), (2), (3) and (4). In order to 
solve this problem we can redefine it by mserting the 
constraints to the initial cost function and thus constructing 
a new energy function E. By minimizing this function we 
assure that the constraints are satisfied. 

In general, such an energy function can be written in the 
form of two terms: E = “cost” + “global constraints” 


In this way, the energy function for our problem is 
becoming: 


2 , 
£-4 DY es +8 D [Ds] +C-TA Ys, -k 
7e¥ pec teV \ peC jee ie¥ y 


+D-> Ys,-000-¢c] VieV, WeC 
ject i 
i720 
O,if x<0 


and A, B, C and D are parameters which denote the 
importance (weight) of each term. By minimizing the above 
energy function is the same as minimizing the first term and 
making the rest three terms equal to zero. But these three 
terms are equal to zero only when the constraimts are 
satisfied. 


(Equation 5) where @(x) = { 


In order to solve this problem we can use a neural network 
created by Hopfield and Tank{3], [5]. The solution you get 
may not be optimal in certain instances, but by and large 
you may get a solution close to the optimal in a short period 
of time. As said, we have given a BTS layout JJ = N BTSs 
and M CSSs at given also positions. We construct a neural 
network which consists of n x m neurons arranged in a two 
dimensional array of n neurons per row and m per column. 
The network is fully connected. The most important task is 
finding an appropriate weight connection matrix. It is 
constructed taking into account that non-valid solutions 
should be prevented and valid solutions should be preferred. 
One of the considerations in this regard is, for example, no 
two neurons in the same row should fire in the same cycle of 
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operation of the network and no more that k, neurons in the 
same column should fire in the same cycle. 


In this context, the Kronecker Delta function is used to 
facilitate simple notation. We refer to the neurons with two 
subscripts, one for the BTS it refers to, and the other for the 
CSS this BTS is connected to. Therefore, an element of the 
weight matrix for a connection between two neurons needs 
to have four subscripts, with a comma after two of the 
subscripts. An example is wi, referring to the weight on 
the connection between the (ik) neuron and the (lj) neuron. 
The value of this weight is set as follows: 


Wy y =—-A- 6 Pi Og — ~B-6,(1- 6,;) (6) 


The inputs to the network are chosen arbitrary. In general 
each neuron is described by two variables: its internal 
potential or activation a; and its output u;. The output u; is 
usually related to the internal potential a; by a non- 
decreasing monotonic activation function u,; = ¥(aj), 
typically the hyperbolic tangent. This activation function 
operates as a threshold function to limit output of each 
neuron to the interval 0 to 1. In general, the Hopfield analog 
neural network can be described by the system of 
differential equations 


B= |e, 220] ©) 

dt Ou, 

with u ah 1+ tank 2) G=1,2 n) (8) 
ii = ese 


1 
where yw, =—>0 
is the learning coefficient (reciprocal of the integration time 
constant T;). 


Gju; is the decay term with damping coefficient k; > 0 
ifs means the computational temperature 

Uj is the output of the i-th neuron 

Qj is the internal potential of the i-th neuron 


E(u) is the energy function 


The equation (8) ensures that the output u, will always range 
from 0 to 1 regardless of the value of the activation qj. 

The system of differential equations (7) and (8) has the 
following property: starting from any initial point in the 
hypercube (-1 <= u; <= 1) the dynamic system converges to 
a local minimum of the energy function E(u) located on a 
vertex or surface of the hypercube. 


Since an analog neural network converges to a solution very 
rapidly one can run the network many times starting from 
different initial conditions within a short period of time and 
eventually may find some very good solutions. Starting 
from different initial conditions the neural network 
described in Eqs. (8) and (9)may find various local minima, 
hopefully also a global minimum. In our case the derivative 


of the activation aj according to Equations (5) and (7) will 
be: 

da, f 

Stab. py +B->| x, -1|+C- ey Ly = ke (9) 


ieV \ jeC 

DSA Fx, 000-9] 

iV 
Shere t is the time constant of the amplifiers in the system. 
It can, without loss of generality, be set to 1.0. The output 
could be evaluated from the equation (8). 
Found that the time interval taken in the differential form of 
equation (9) should be a fraction of t, the time constant of 
the system. 
Thus the updating rule will be 


da, 
a,(t+1)=a, (+16 (10) 


In our case the above equation becomes: 


a, (C41) = a,()+{-E— a. p, +B: >[Z,-1 


jteV \ jeC 
nce i aes Jeo. TPs, (i) - }: a (11) 
jec ieV 1eV 


Using the above technique a program was constructed using 
the c programming language to implement the neural 
network algorithm. The algorithm can be graphically 


described from the figure 1: 
Find random initial input Find initial weight matrix 
ector for the neural network| using equation 6 


Find initial activations 
Use equation 11 
to find the new activations 
of each neuron 
Use equation 8 
to find the new outputs 
of each neuron 


If outputs are 
the same for the 
last N iterations 


Figure | 


The example we used in order to test the algorithm is a 9x9 
layout of BTSs. The load of each BTS is shown at the 
following figure. The maximum number of B7Ss that can be 
connected to a single CSS is 16, while the maximum total 
load that a CSS can handle is 140 (total load 820Erlangs). 
Using a greedy heuristic algorithm to solve the problem of 
placing CSSs with a minimum cost we found that the CSSs 
should be placed at the positions 15, 17, 23, 31, 33, 41, 47, 
67 and 73 (shown with bold in the figure). 
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Figure 2 


Then the above algorithm was used and found the 
connections that minimize the cost function while the 
constraints are satisfied. The total cost is 176, which is true 
the minimum cost. The connections of the BTSs to the CSSs 
are depicted at the following figure (figure 3). 


Figure 3 


The total load of each CSS and the number of connected 
BSSs are: 


Load (Erlangs BTS connected 


é 

100 6 
80 
a= 


Finally, we suppose that the load of the BTS layout is 
modified by reversing, for example, the load distribution. 
Thus, the load of the BSSs from outside to the center is now 
15->10->5->1Erlangs (see figure 2). After applying again 
the neural network proposed algorithm we found a new 
solution that can be seen in figure (4). 


| 
© OOOG0e 


Figure 4 


oo 


The total cost is now 188. But, even if the cost is not the 
minimum, the solution found is the optimum solution that 
satisfies the constraints, after the redistribution of the load. 


C. Other Algorithms 


In this section two well-known algorithms will be compared 
to the neural network algorithm. The simulated annealing 
technique and a genetic algorithm. 


Both methods can escape from local minima, though they 
need too much time to produce efficient solutions. On the 
contrary, the neural network approach tends to give 
solutions with less computing time than the other two. 

In general, neural networks would not reach the optimum 
solution, but with proper constraints, neural networks can 
find sub-optimal solutions in a very short period of time 
which are good enough for many applications. 


V. CONCLUSIONS 


This paper addressed the problem of designing and 
controlling the interconnecting network of the access 
segment of a cellular communications system, under the 
assumption that the underlying communication is based on 
ATM. The architecture of the interconnecting network was 
presented. Subsequently, the problem was formally stated, 
optimally formulated, and solved in a computationally 
efficient manner by means of a neural network algorithm. 
Finally, this algorithm was compared with two know 
algorithms: simulated annealing and genetic algorithm. 
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A Realistic Interference-Oriented Version of the 
Frequency Planning Problem for Future 
Wireless Telecommunication Systems 


S.A.Kotrotsos, P.P.Demestichas, E.C.Tzifa, V.P.Demesticha, M.E.Anagnostou 


Abstract—The cost effectiveness of cellular and broadband 
radio access systems is coupled with the efficient exploitation 
of the scarcely available radio spectrum. In this perspective, 
the frequency assignment problem will maintain its 
importance for several years to come. This paper elaborates on 
an interference-oriented version of the frequency assignment 
problem. The objective function is associated with the 
interference levels that are imposed by the frequency 
allocation, while the constraints. are related to the allocation of 
the frequencies required in each cell and the prevention of 
some unacceptable interference situations. In this paper, the 
problem is identified, described in detail, formally stated and 
formulated as an optimization problem. A computationally 
efficient heuristic algorithm that falls in the simulated 
annealing (SA) category is presented, which gave pretty 
satisfactory solutions in realistic test cases. 


Index Terms--GSM, UMTS, co-channel and adjacent channel 
interference, frequency assignment 


I. INTRODUCTION 


One could conceive that the cellular and radio access 

market of the coming years will be compound of - among 

others - the following systems: 

e Future versions of legacy cellular communications 
systems, e.g., those of GSM; 

e Third generation cellular systems as UMTS; 

e Broadband radio access and distribution systems like 
the ETSI BRAN (Broadband Radio Access Network). 

The cost-effectiveness of these systems is fundamentally 

bound to the efficient exploitation of the radio spectrum, a 

natural resource that has already been carried away from 

abundance. 


According to the current version of the specification the 
future versions of GSM and BRAN systems will use radio 
spectrum allocation techniques that are either based 
exclusively on FCA (Fixed Channel Allocation), or have 
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FCA-based components. Furthermore, it is anticipated that 
the interference conditions in the future radio environment 
will be quite complex. Therefore, versions of the frequency 
assignment (planning) problem, with objective the 
minimization of the interference levels caused by the 
assignment of frequencies to base stations, will maintain 
their importance in the future. 


In general, the frequency assignment (planning) problem 
may be described as follows: 

“Given the cell layout of the system, the set of available 
frequencies, the traffic-load prediction in each cell and the 
interference conditions in the network, find an allocation of 
frequencies to cells that optimizes an objective function and 
satisfies the problem constraints ”’. 


Under the term traffic-load prediction that is used above 
factors as service preferences and users mobility 
characteristics are comprised. Various objective functions 
and constraints have been considered in the literature [1]. A 
general classification scheme separates the versions of the 
problem into the interference oriented and the traffic 
oriented categories. Algorithms of the first category 
optimize the interference levels of a system, under 
constraints that derive from the traffic that should be carried 
in each cell and some interference conditions that should be 
avoided. Schemes of the second (and perhaps largest) 
allocate frequencies to cells in order to optimize a quality 
criterion associated with the traffic that is carried by the 
system, subject to interference constraints. 


Motivated by the anticipated complexity surge of the 
interference conditions of the environment, we present here 
an interference-oriented version of the problem. 


II. PROBLEM DESCRIPTION 


A first assumption regarding the inputs of the problem is 
that the expected traffic and mobility conditions that are 
encountered in each cell are translated, through appropriate 
pre-processing into the number of frequencies required in 
each cell. Théobjective of this step is to ensure that the 
QoS levels in each cell will exceed some pre-defined 
thresholds. The exact realization of this phase is beyond the 
scope of this paper, as it is methodologically independent 
from the rest of our approach. 
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The factors that are assumed t cominbut w the 

Interference-cost of the system are classified below. 

‘ Furst, the co-channel interference, which is cansed by 
the use of the same frequency im neighboring, cells. 


neighborme (or “near-by*) cells. 
. Third, extemal factors, which may be caused by other 
Operaings or other systems. 
The tem ‘neighborime” used here refers to imierference 
provocation level. a pattem not always obvious by the 
Spatial distnbution of the cells. 


> 


imitations, impess 2 minimam Ssperiion beiween 
frequencies that are assigned to a cell, named 2 combiner 
Separatiom constamt Finally, m order t accom fr 
extemal factors that contribute to the interference levels we 
allow cach cell tw have an associated set of forbidden 
frequencies - 


The soft co-chamnel imterference and the soft adjacent 


Chamnel imferference values provide the ultimate aspect of 


@ measure of the interference that is induced to cell i by 
the allocation to cell 7 of a frequency that is used in 7 _In 
puneiple, the level of this imterference is not prohibitively 
high: otherwise it can be represented by 2 hard co-chimn=! 
mferference constraint The soft adjacemt = chamne] 
imterference values provide a measure of the interference 
imdnced to cell i by the allocation t cell j of a frequency 
that is adjacent to one used in i 


We believe that the ahov aspects yield 2 realistic version 


problem version addressed in this paper is the following. 


Given 
(2) the set of cells in the system, 
(®) the set of available frequencies 


(c) the mumber of frequencies that are required in cach cell 


frequencies, 
(&) for each pair of cells the interference levels that will be 
indaced m the system if they acquire the seme frequency, 
find =n allocaton of frequencies to cells that optimizes 2 
levels m the sysiem, allocates the number of fequencies 
required im cach cell, and respects the comstramts (b)—(¢). 


TL ForMaL Prose STATEMENT 
The set of cells is demoted as V _ The set of availble 
frequencies 8 demoted as F _ Each cell requires 2 mmmber 
mobility conditions. This requirement is denoted as_r(i) 
(Vi <V) The combiner separation constraint of each coll 
is denoted as c(i) (Vi CV). The set of frequemcies that 
Should not be used im (ae forbidden for) cell FEF i 
denoted as FF(i) (FF(i) c F). 
In the sequel, we defime, for cach i €V , the set of cells 
CCli) (CO) <V ), witich contains the cells that are mot 
allowed t use 2 frequency also allocated to cell 7_ In 2 
Smuler manner, the set of cells that should mot ese 2 
frequency that is adjacent to 2 frequency that is allocated to 
cell 7, is denoted as AC{i) ( AC(i) CV). Heme, the 
variows CC{i) and AC(i) sets model the hard co-channel 
problem. 
The next aspect thet should be modeled iS the soft 
mierference valmes. To this ead, we define far every 
i,j) <V*_2 vale S.(;, j). xeyncscuting the co ued 
iterference that will be imposed m cell i by cell 7, if 
they are allocated the same fiequency. Similarly, the value 
S,(i.j) represeats the adjaccat homme] interference thet 
will be mmposed m cell i by cell 7, if they are allocated 
adjacem fregnencies. 
The aim of the Problem is @ find a allocation of 
frequencies t cells 4 = {Ai eV}. The set Afi) 
( Ali) < F ) comprises the frequencies that axe allocated wo 
cell z (7 €V). The aim of the allocation is to minimize a 
cost famction, denoted as C( A), which is relied & the 
(Soft) mmerference levels in the system. 
The comstraimts of the Problem address the followmg 
aspects. The first set of comstraints reflects that each cell 
Should be allocated as many frequencies as required. Hence. 
the condition | A(i)|= (i). for al i €V. showld bok 
The second set of constraims should mpese the 
Prescrvaiion of the RECSSSEy combmer separstion 


oe ee 


constraints. Therefore, for all i €V and for all the pairs 


(f,g) e(A(i) x A(i)) the condition If - g| > c(i) 
should hold. The third set of constraints should reflect that 
no cell should use a forbidden frequency. Hence, for all 


i EV thecondition A(i) \ FF(i) =@ should hold. 
The fourth set of constraints should ensure that the co- 
channel interference constraints are met. This means that 


for every i€V the condition A(i)A A(j) = if 


j €CC(i), should hold. The fourth set of constraints 
should ensure that the adjacent frequency interference 
constraints are met. This means that for every i EV , and 


for every f € A(i), the conditions ( i+ 1) ¢ Al j) and 
(f -1) € Aj), if 7 € AC(i), should hold. 


IV. MATHEMATICAL FORMULATION 
In order to describe in a more convenient manner the 
allocation A we define the decision variables Xj, having 


the value 1 if frequency f is allocated to cell i and 0 
otherwise. The problem of establishing the best allocation 
of frequencies to cells may be reduced to the following 
mathematical programming problem. 


Minimize 
A) = yoy [L.f)+ LS) 

Subject to: es 
Lif) =X SAii)-* 

Wie V, VP eF (2) 

LA) =ZSCI) ei +0] 

Hie V, Vf eF (3) 

Dieta y(?) 

fer 

Vi eV (4) 
If xy — g-%4|2 cli) 

VieV, V(f,g) €F? (5) 


al een Xb, ee v))= 0 


jeCC(i) jeAC(i) 


VieV,Vf EF (6) 
{0,1} 
VieV,Vf EF (7) 


A= {xl eV,f < F} 
(8) 


Relation (1) expresses the interference levels in the system, 
which are a function of the co-channel interference, 


ie (i, hi ) , and the adjacent frequency (channel) 


interference, I, (i he ) , that each cell 7 will sense on each 


allocated frequency f. The co-channel interference, 
1b (i yh ) , depends on whether other cells 7 , imposing soft 


co-channel interference on i (value S.(, j) ), use also 


frequency f . This is expressed by relation (2). Similarly, 
the adjacent channel interference, J, (i, f ), depends on 


whether other cells 7, imposing soft adjacent channel 


interference on i (value S, (i ; j) ), use frequency f —1 


or f +1. This is expressed by relation (3). Relation (4) 
ensures that each cell will be allocated as many frequencies 
as required. Relation (5) ensures that if two frequencies f , 


g are allocated to cell 7 the separation between them will 
be greater than, or equal to, the combiner separation. 
Relation (6) ensures that if cell 7 is allocated frequency 


f, then no cell j €CC(i) will be allocated frequency 
f, and no cell j € AC(i) will be allocated an adjacent 
frequency, namely f —1 or f +1. 


V. ALGORITHM BASED ON SIMULATED ANNEALING 


Annealing is the physical process in which a material 
transits from liquid to solid state. Gradual cooling brings 
some materials to crystal form, representing a minimum 
energy state. Modeling this process in order to solve an 
optimization problem is called Simulated Annealing. In this 
concept each solution is considered as a state, its cost as the 
energy of the state and the optimal solution as the state of 
minimum energy. During each phase of the algorithm, a 
new solution is generated by elementary alteration of the 
current best solution. If the new solution’s cost is lower, the 
new solution becomes the currently best solution. Solutions 
that increase the cost may also be accepted with probability 


e taal cry This is the fundamental mechanism of the 


algorithm so as to avoid blocking in local optima. CT is a 
control parameter, which may be perceived as the physical 
analogous of the temperature in the physical process. This 
parameter is decreased as the algorithm proceeds, according 
to the cooling schedule. The algorithm ends when either 
CT =0 or when a significant number of moves have been 
made without improving the cost function. 


The development of a simulated annealing based procedure 
requires the modeling of the following aspects: 
configuration space, cost function, neighborhood structure 
and cooling schedule (i.e., manner in which the temperature 
will be reduced). 

The following apply with respect to our problem. The 
configuration space is the set of feasible solutions 
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{x vl eV,feF (i.e., allocations of frequencies to 


cells) that satisfy the constraints (2)-(8). The cost function 
is the one introduced by relation (1). 


Altering the value of two decision variables Xi, and Xipy> 


while preserving the set of constraints (2) - (8) produces the 
neighborhood structure. The cell that will change frequency 
and the frequency that will be released is selected according 
to a random process that follows a uniform distribution. The 
new frequency assignment should not violate the 
interference constraints. 

There are various options regarding the cooling schedule. A 
recent advanced survey on this aspect may be found in [2]. 
The first (simpler and better known) scheme identified 
therein is the geometric cooling scheme. It follows the 


relations 72 = 7-Cr and Gf =f“ where’ CT “is 


new new? 
the temperature in a certain phase of the algorithm (current 
temperature) and 7 is a factor that usually ranges from 0.95 
to 0.99. The second scheme, which was used in this paper, 
is the, so-called, “re-heating as a function of the cost”. As 
stated in [2] the scheme was introduced in [3]. The re- 
heating scheme may increase the temperature of the 
algorithm, and therefore, prolong the mun-time of the 
algorithm. It is an advanced mechanism for escaping from 
local optima. Finally, in our paper re-heating was applied in 
conjunction with adaptive cooling, which the third cooling 
scheme identified in [2]. This is a sophisticated scheme for 
reducing the temperature of the algorithm, suitable for 
intensifying the search at certain areas of the solution space. 
Finally we adopted an additional empirical constraint: the 
maximum allowed interference cost that is provoked by a 
cell to the rest of the system. The latter prevented the 
system from producing inequalities in the interference 
distribution. 


The simulated annealing-based algorithm may be described 
as follows: 


Step 0. Initialisation - get an initial solution, JS, and an 
initial temperature value 7. The currently best solution 
(CBS) is IS, ie. NBS=IJS, and the current 
temperature value(CT) is T,ie., CT=T. 

Step 1. If CT =0; or if the stop criterion is satisfied, the 
procedure ends and a transition to step 6 is performed. 

Step 2. A new solution (NS) that is neighbouring to 
CBS is found. 

Step 3. The difference of the costs of the two solutions, 
CBS and NS is found, i., the quantity Ac= 
C(NS)-— C(CBS) is computed. 

Step 4. If Ac <0 then the new solution becomes the 
currently best solution, i., CBS = NS. Otherwise, if 


Ac = 0, then if a) Ps rand|0,1), the new solution 
becomes the currently best solution, ie., CBS = NS. 

Step 5. The cooling schedule, which is a combination of 
the re-heating as a function of cost, the adaptive and 
geometric cooling schemes, is applied, in order to calculate 


the new current temperature value CT , and a transition to 
step | is performed. 

Step 6. End. 

Various alternatives may be applied for realizing the stop 
criterion in step J. The algorithm may stop when no 
improvement has been made after a given number of 
temperature decreases, which may involve applications of 
the re-heating criterion. 


VI. RESULTS 


The SA version described above was put through extensive 
testing, giving quite promising results. Below we 
summarize the main characteristics of the test cases we used 
and the impressions we received. 


A first approach was to consider a 7x7 grid of hexagonal 
cells of equal size, assuming equally distributed traffic load. 
This could be translated to a fixed channel requirement for 
all cells, assumed to be 3. Hard constraints for adjacent 
channel were assumed to be the directly neighboring cells. 
The case for co-channel included also the cells distancing 
up to twice the cell radius from the base station (cell 
center). Considering a set of about 41 available frequencies 
an implementation of the algorithm gave an exceptional 
output plan achieving a balance in the average frequency 
reuse distance ranging form 6 to 9 times the cell radius. 


A second more realistic approach was to assume a non- 
uniformly distributed traffic. In this concept we considered 
the same 7x7 grid having a traffic hot spot in the center cell 
and the cells around it and low traffic in the rest of the cells. 
We considered channel requirement of 5 in the center, 4 and 
3 around it and 2 for the remaining cells. This concept 
modeled a city center or generally a high traffic region of an 
urban area. The algorithm proved to be quite effective for 
this case too, producing results with average frequency 
reuse distance of 5 to 9 times the cell radius. 


Finally we modeled Athens region using realistic traffic 
maps and existing cell shapes and characteristics. For a 
number of 500 cells and 50 frequencies we achieved a 
significantly improved planning lowering the interference 
costs, in comparison to the current situation. 
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Wireless Communication: Today’s need 
Tomorrow Promise 


ADEWUSI, Adebowale Ayodele, MJEEE 


Abstract-- Although Wireless technology is years old, it has 
recently caught the attention of the research community again. 
This paper examines the impact of economic and political 
problems on wireless systems. This in relation to the third 
world countries where either poor or nonexistent of telephony 
system has brought an obvious economic justification of the 
use of wireless communication. Wireless communication can 
bring the power of information to virtually everyone on the 
planet and can enable us to share experiences instantaneously. 


I. INTRODUCTION 


Over the decades, social values have become more and more 
complex, and the resulting problems have become so severe 
that we need more extensive, real-time solutions to address 
and resolve these highly volatile, real-time situations. The 
scientist and engineers of the past 25 years have witnessed the 
dawn of the global information age. A new era in the 
communication of information is upon us. We have thus 
entered into a world that is being transformed by information 
technology and communication in ways that may defy the 
periphery of today’s reality and even our imagination. 
Communication technology is exploding and is rapidly being 
put into use. 


We can perceive these changes in three different platform vis. 
Technical, economical and political. This paper will dwell on 
the last two platforms. This is mainly due to the fact that the 
economy and politics in the world are the main driving force 
to the implementation of whichever technology we decide to 
use. We have seen unlimited opportunity for application of 
information technology at large. It is facts that better 
management and efficient and proper use of information is the 
key to progress. The telecommunication industry is increasing 
globally and by estimation we should have close to 1 billion 
installed terminals by the end of this decade. Greatly desired 
are high-speed systems that will deliver interactive voice, text, 
data and video sources to businesses and homes. Greatly 
believe to be the cornerstone to prosperity and, as a vehicle for 
economic growth is telecommunication. 


I]. WIRELESS COMMUNICATION AND 
INFORMATION REVOLUTION: 


Communication has affected several aspects of national and 
international businesses. To some extent, the daily lives of 
people throughout the universe have benefited from _ it. 
Wireless is the use of radio technology to provide a fixed 
phone line both because the use of radio is less expensive than 
digging up roads and secondly because it allows for more 
rapid installations. Wireless is not the same as Cellular and 


cordless and it is worth noting the difference being that 
cellular radio provides access to a mobile terminal as opposed 
to a fixed terminal as in the case of wireless. Cordless is 
similar to cellular except that it typically designed for office 
and local area use. A radio path replaces the copper cable in 
the traditional method. As for conventional wired network, the 
cabling within the house is same except that the connection 
between the local switching station to the house is wireless. 
The only difference that the subscriber will notice is the small 
receiver on the side of the building. Wireless technology does 
not require wide bandwidth and can be at lower frequencies 
with the advantage of longer range and subscriber equipment 
that are less reduced in cost. Recently, wireless technology has 
been put into use in Nigeria with Lagos as the starting point. 
Over 10,000 lines are expected to be provided. However, the 
peculiar problem of electricity supply in that part of the 
country has had a tremendous effect on the success rate of the 
exercise. 


Il]. PROBLEMS AREAS 


The several problems that have either hindered the progress 
in wireless communication as well as areas to be given 
attention to are enumerated below: 

1. Lack of standardized radio spectrum and lack of 
standard technology 

2. A very strong challenge from other competing methods 
such as digital systems (TV and over copper), internet 
telephony, cellular and cordless and satellite 

3. Tariffs (do you know that international calls cost just a 
little more to make than local calls but it is very 
surprising that they are charged at a much higher rate? 
For example in Nigeria Local calls cost N3.15 ($0.032) 
per minute trunk calls costs between N12.00 ($0.123) 
and N35.00($0.36) and international calls costs 
between N550.00 ($5.67) and N925.00 ($9.54)) 

4. Regulations: Internationally, liberalization has become 
an increasingly powerful force in shaping 
telecommunication market and services. It has already 
transformed the nature of provision as obtained in the 
UK and USA. The control of private enterprise by any 
government could either be active or passive. 

5. Prices must come down. In Nigeria to acquire a land 
line used to cost N50,000.00 ($515.46) until recently 
when the new democratic elected government reduced 
it to N21,000.00 ($216.49). 

6. Political barriers 
1) In most developing and under developed countries, 

the political climate has been a major factor to the 
development and adoption of wireless communication. 
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A case of Nigeria where political instability coupled 
with economy problems has led to a monopoly in 
telecommunication industry. Recently, the government 
of Nigeria is trying its possible best to woo investor into 
the countries and seriously working on privatization of 
some sectors of the economy. We are expecting series of 
legislative changes that introduce a new concept 
entirely into telecommunication 

2) While many developed countries in North America 
and Europe are thinking or adopting wirelesses 
technology many other countries especially in Africa 
are still facing the problem of access to basic 
telecommunication infrastructure. The cost even 
installing telephony systems using the traditional copper 
technology is still more than an average citizen can 
afford than going for a costly wireless connection. 


Typical! national telecommunication needs are the provision of 
trunk telephony between cities. Thin route line in remote areas 
is also a major requirement. A solid and effective distribution 
network system and a combination of networks that are 
effective will go along way to increase the functionality’s and 
appreciation of communication. 


IV. RATIONALE FOR USAGE 


If the current efforts being put into research in Wireless 
technology leads to the desired results then it will be worth 
while to adopt this technology. The use of wireless 
communications in regions with either poor or nonexistent 
telephony provision has an obvious economic justification; it’s 
generally going to be much cheaper than copper. What is less 
obvious is that there is going to be a second, also potentially 
very large, market in regions where existing telephony 
provision is effectively at saturation levels and in developed 
countries where operators are seeking for means of providing 
a “better” alternative. In these instances the role of the 
wireless will be an instrument of telecommunication 
liberalization, allowing new entrants to compete effectively 
with the dominant established operator. 


The networks of the future are evolving gracefully from the 
exiting networks of today. The next fundamental step that is a 
key point will be to estadlish a good and perfect framework 
for tomorrow information network. This framework is 
already taking shape today based on the use of digital 
technology in the public network as opposed to the old 
switching system. The underlying problem of liberalisation 
management will remain as challenging as ever. 


Points to note here are the fact that wireless is less expensive to 


install than digging up roads, wireless technology takes less 
time 


V. EFFECTS /IMPACT OF WIRELESS 
COMMUNICATION 


A. ON USER 


Events around the world have shown to us that development is 
not simply economic growth but the sustainable improvement 
in living conditions for those in the greatest need. The 


information era is already having and would continue to have 
fundamental impacts on all aspects of life. Our entire societal 
system —judicial, political, educational, social, economic, and 
ideological perspective- and even world peace, war and 
stability are definitely being affected by the rapid changes in 
telecommunication technology. The speed at which the 
information revolution is changing the world is much greater 
than the geopolitical-economic changes caused by the 
industrial revolution. 


The major impact on the user has to do with costs. The 
promise at the initial stage was that wireless was inexpensive 
and would replace the cables that had previously carried 
traffic. Rather unfortunately, history and events have shown a 
real gap between promise and performance. The service 
obligations are not all the costs. One of the presumptions 
resulting from the implementation of telecommunication 
satellites that has come true is the dramatic reduction of cables 
installed. This is mostly due to several government regulations 
rather than economics. The rates of cables have gone down 
rapidly in costs, at a faster rate than Satellite tariffs when 
economical factors are considered. 


The impact of costs is also in personal communication 
application of wireless communication. The costs of providing 
ordinary voice communication are always most impossible to 
estimate directly since an end user has no choice whether his 
signal goes on or not. Several issues that are of concern to the 
users include the following: Political & Economic trends, 
Privacy, Reliability, Dedicated networks, Linkages to other 
systems, Control of contents. Others include unauthorized 
reception, unwanted reception, Private networks, Effect of 
private ownership, Confusion of public and private uses, and 
transponder issues. 


B. MARKET REQUIREMENTS 


Two phenomena that are already clichés but highly 
misunderstood: “the information age and the global village” 
are summarized by Satellite communications. The need to 
eliminate the barrier of distance that have impeded economic 
growth, social-service delivery, and public participation in 
rural and remote areas of industrialized, developing and 
under-developed countries are a great drive for wireless 
communications. At present, market for wireless 
communications in Nigeria is growing in number, although a 
bit fragmented, but extending and establishing them. As 
previously noted above, liberalization is becoming an 
increasingly powerful force in shaping telecommunications 
market and services. This has already transformed the nature 
of provision in the Europe, USA, and the Japan. Another main 
factor affecting market requirements is the need to increase 
rural resident’s access to information, raising agriculture 
productivity and promoting rural industries development. The 
impact of telecommunication on education, not only in 
decision, but in the home and workplace is also affecting the 
market requirements. 


The major effects and impact as seen of the market 
requirements are: 


1. Changes in telecommunication technology 
Changes in telecommunication regulatory policy 
Changes in demand for telecommunication services 
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4. Industrial, technological, and cultural policies 
5. Globalization 
6. Reliability 


These six effects can be viewed under three perspectives: 

1. 1. Monitoring continuously the various developments 
and wireless communication system’s development, 
adjusting and implementing the changes as necessary 

2. 2. Defining a clear set of overall aims and objectives 
for the wireless communication system, which are 
viable for the system 

3. 3. Determining and organizing in detail the essential 
parts of the market forces and their subsequent 
integration /implementation to achieve the overall aims 
and objectives set above 


With increasing competition for market place, service 
providers have to tailor services more closely to the needs of 
customers. The constraint of competition must be understood 
and dealt with. In information technology and 
telecommunications, standards are important at all levels. The 
need to remove the traditional information 
system/telecommunications cultural divide, both technically 
and at the services and applications levels is making 
telecommunication to cause a global village. 


VI. FINAL WORDS 


Good communications, both between the African environment 
and the rest of the world for administrative, public and 
emergency use, could bring great benefits. In time, wireless 
communication would remain a viable extension of the cabled 
network for overall purposes on the extremely rural side. 
Wireless communication has some growing yet to-do, but if the 
barriers of regulations and monopoly are successfully dissolved, 
it may be about to enter its second golden age. Thus to be 
continuously successful, the telecommunications industry needs 
to always consider the full picture, from pricing to support of 
communication. However, the underlying problem of 
liberalisation management -- balancing the desire for 
competition with a continuing requirement for protection- 
remains as challenging as ever. Smaller and cheaper earth 
stations, many of them solar powered coupled with new and 
improved radio and network technologies have made it possible 
to reach virtually every human settlement. 


0) ADEWUSI, B.A.: Internetworking: Gateway for enhanced 
Productivity, Workshop Proceeding of National Workshop of Nigeria 
Internet Group, September 1995, pp. 132-136. 

(2] BEIABURE, M., HERON, B., and LEAK, M.,: Communications 
in Kiribati, IEE Review, May 1991, pp.179-183. 

{3] Dettmer, R.: Where there is a WILL, IEE Review, July 1995, 
pp. 145-148. 

[4] Hart, C.: “Fixed Wireless access: a market and system overview” 
Electron. Commun. Eng. J. October 1998, 10, (5), pp.213-220. 


10" Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


51 19/ 


A Direct Conversion Receiver Analysis 
for Multistandard Wireless Applications 


L. Dermentzoglou Member IEEE, A. Arapoyanni Member IEEE, A. Pneumatikakis 


Abstract— This paper presents a direct conversion receiver 
modified for tri-band operation (GSM900/ GSM1800/ 
PCS1900). The IF stage in zero frequency, splits into I and Q 
branches while offset cancellation takes place in digital 
domain and is not included in the objectives of this paper. 
The overall receiver performance is evaluated in terms of 
Noise Figure, Intermodulation, blocking requirements, SNR, 
and C/L Multimode operation is ensured by the fulfillment of 
the most demanding system specifications among the three 
participants. The complete system analysis that was carried 
out using HP-ADS, subjects the receiver into four different 
sets of tests and the main objective is the extraction of 
feasible building block specifications in order to guide a 
silicon design process. 


Index terms—Direct conversion, adjacent channel, 
intermodulation, interfering, SNR, C/L 


If INTRODUCTION 


The evolution in the field of wireless communications has 
imposed some new standards concerning the receiver and 
transmitter design approaches. The growing need for fully 
integrability, and low power operation is leading in the 
transfer of primary functions of the receiver into the 
baseband. 


Additionally the globalization in telecommunications 
makes the demand for standards operating almost 
everywhere in the world more critical than ever. Until the 
use of a universal standard is agreed, conventional 
wireless standards like GSM900, GSM1800, and PCS1900 
will be the path for what is said a “world cellular phone”. 


Receiver planning, especially in multimode operation is 
very often the essence of a large amount of considerations, 
leading to an even greater amount of tradeoffs. Such 
tradeoffs include gain distribution among the various 
building blocks of the receiver, linearity and noise 
performance and power consumption, in order to achieve 
the optimum GMSK signal handling according to system 
specifications as they are defined in standardization 
documents [1]. The work reported here presents the 
simulation results of non-ideal building block models that 
provide optimal and feasible hardware specifications that 
are compatible to the system requirements. These 
specifications ensure the reliable performance essential for 
a first silicon run. 


This paper is divided in five sections: In section II the 
system requirements and test procedure is presented while 
in section III the receiver architecture and its main 
principles are analyzed. Then in section IV the simulation 


results leading to optimized building block specifications 
are given and finally the conclusions and feasibility results 
are granted. 


IL. SYSTEM REQUIREMENTS AND TESTS 


The guarantee for a multistandard operation is the 
fulfillment of system specifications provided in 
standardization documents. In this case, the direct 
conversion receiver under test, is ought to operate properly 
according to GSM900, GSM1800, PCS1900 system 
requirements. This proper operation is determined 
achieving successful results in a number of tests. The 
success is confirmed meeting the goals given in table 1. 


Table1: Performance Requa 
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The required tests in which the receiver has to be subjected 
to, include first of all the blocking or desensitization 
ability of the receiver. This determines the performance of 
the receiver under strong unmodulated in band and out of 
band blockers. 


The adjacent channel rejection is a factor of the receiver, 
which grants the proper handling of the wanted signal 
without exceeding a given degradation due to the existence 
of neighboring unwanted modulated signals. 


The intermodulation response rejection mainly determines 
the linearity requirements. This is done by maintaining C/I 
well above 12 dB for all the incorporated standards 
considering two unwanted signals, a continuous sinewave 
fi and a modulated tone /2, both at a power level of —-43 
dBm for GSM900, and -49 dBm for GSM1800 and 


PCS1900. The above-mentioned signals are related by the 
formula: 


f2-f1|=800 kHz (1) 


Finally noise analysis determines the performance of the 
behavioral models in terms of the added noise to the 
system due to their imperfect operation. 
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Ill. THE DIRECT CONVERSION RECEIVER 


The principle of the direct conversion receiver that was 
chosen for the multimode operation is presented in figure 


Detector 


——_—_ FF section a oo Baseband Section ——————_> 


Figure 1: A direct conversion receiver oriented in 
multimode GMSK signal handling 


In this approach one of the major concerns was to keep as 
much hardware as possible common for the operating 
standards. For this reason the front end was chosen to meet 
the individual specifications in order to take advantage of 
filters selectivity or low noise amplifiers reverse isolation, 
necessary for the attenuation of the local oscillator leakage 
signal. 


In details, several tradeoffs have to be taken into account 
in order to determine the performance of each building 
block of the receiver. One of these concerns is the gain of 
both the LNA and mixer structure [2]. Although, a large 
gain should be convenient for compensating the overall 
noise figure of the receiver, in the same time this gain may 
overload the forthcoming structures limiting the overall 
dynamic range. In cases where highly linear IF amplifier 
blocks are not feasible, it is wise to use adjustable gain low 
noise amplifiers. 


COS@zot 


(b) 
Figure 2: Self mixing of (a) Interferers (6) LO 


Another concern has to do with the offset voltages 
produced during the down conversion to baseband. These 
dc signals may corrupt the wanted signal and more often 
lead the following stages to saturation [3]. This 
phenomenon which appears due to the imperfect isolation 
between the LO port and the inputs of the mixer and the 
LNA is best described in figure 2. 


The problem of offset cancellation is a very complicated 
one, directly connected to software algorithms and is not 
included in the objectives of this paper. In general, a dc 
correction algorithm is based on averaging the incoming 
corrupted signal over a relatively long period, and the 
result is periodically subtracted from the signal [4]. 
Substraction of two to three times per second is considered 
to be reliable. Figure 3 presents in a more detailed version 
the time variant offset cancellation concept [5]. 


Rae Averaging 
Circuit 


Figure 3; DC offset digital stage canceller 


Finally, the direct conversion receiver incorporates 
quadrature down-conversion. The I and Q branches of the 
receiver carry different types of data and it is crucial that 
experience the minimum phase and gain imbalances, in 
order to impose the less possible error to the signal 
constellation. It has been proven that phase mismatch less 
than 5° and gain imbalances around 1 dB degrade the SNR 
of the signal by roughly 1 dB. 


IV. SIMULATION RESULTS 


Simulation has been used for the optimization of the direct 
conversion receiver chain. The result of this procedure is 
the extraction of feasible specifications for the hardware of 
the analog part in order to ensure the reliable operation of 
the receiver, dictated by the system specifications of the 
standardization documents. Additionally, since the digital 
part has not been incorporated in the simulation, some 
extra attention has been paid so that the digital part won’t 
suffer due to complexity of the FIR filters. In table 2, the 
resulting performance of the receiver is summarized. 


In figure 4 is presented the ability of the receiver to reject 
an adjacent channel interferer of -33 dBm located at 200 
kHz offset from the -82 dBm GMSK useful signal, 
preserving the C/I and SNR criterion. 


10" Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 319 


INPUT SIGNAL IF ap litior 


: Adlocent 
ag=d. 


40 . 20OOCOMHz 


2=-73. : 
an{DCRC2 1.24.2: =200... a 293383. 
Adjacent 


Pie=73 
Ca 


OcR_C2I pele ~g-oet_ D3. e+sTss : 


~1ee 
i Ad|ecent = 

i Plne—73_e00080 
SSm(OCR_C2i_pe]_200__ g_est ise $30730 


ousK 

fr eq=940. OOOOOOMH= 

Pi n=-73.000000_ 
.SREMERER_CA 1a L_2QR. 


Power (dam) 
Power (dim) 


-2 


as ee vs us ze 2 
Frecuency (MHz) Ficamency, canst Fi 


Duplexer : gure 4: Adjacent channel rejection simulation results 


Bes ee As was mentioned above, a key issue is the maximum 
a a integrability of the receiver. Since integrated bandselect 
 Aategeet filters located at the antenna face the bottleneck of limited 
dynamic range or rejection, the use of discrete RF filters 
seemed unavoidable [6]. Due to this fact, no optimization 
of their performance is attempted and their frequency 


response is presented in figure 5. 
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The low-pass filter placed in the baseband is a standard 
= GMSK filter with BT=0.3. The purpose of this filter is to 
ates eS partially select the wanted channel providing 60 dB of 
ae 2 a ee. attenuation of the overall amount of 88 dB required by the 
. system. The filters incorporated in the digital domain 

ie aS . ae provide the remaining rejection. Additionally this filter 
eae fulfills the digital image rejection requirements in order to 
perform a successful quantization of the analog signal. The 

frequency response of the IF low-pass filter is presented in 

Erecesecy <incif figure 6, and its order is calculated to be 6. 
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Figure 6: Frequency response of the channel select, IF, 
low-pass filter. 


The specifications of the various building blocks are 
presented in table 3. According to these the IIP3 of -4 
dBm guarantees that the C/I remains above 12 dB 
although this figure imposes 0.15 dB of compression in the 
LNA engaged in the GSM900 amplifying operation. The 
GSM1800 RF amplifier has a different set of 
specifications since the system blocking requirements are 
less strict compared to those of GSM900/PCS 1900. 


Table 3: Direct Receiver Analog Hardware Specifications 


Parameter 


Operating Frequency 


Phase Mismatch [| 5 | deg 


-85@+200 kHz 
-110@+400 kHz 
-127@+600 kHz 


-135@f-fo<1.6MHz 


According to the above tables is obvious that the direct 
conversion receiver demands excessive spectral purity in 
the VCO output, compared to conventional receiver 
architectures, such as the low IF or super-heterodyne 
topologies [7]. 


Finally, the dynamic range requirements of the ADC are 
set on the high-end by the maximum signa! !:vel presented 
at the ADC input and on the low-end by the amplified 
minimum signal in combination to the bit resolution of the 
ADC. For the direct conversion receiver an 8-bit ADC was 
selected with a full-scale operation at 0 dBm. 


V. CONCLUSIONS 


In this paper, a direct conversion receiver architecture 
trimmed for multimode operation according to GMSK 
standards is presented. Simulation has led to the derivation 
of optimum hardware specifications. These specifications, 
which ensure for proper operation of the receiver 
according to the standards, and are feasible in terms of 
integration, are extracted in order to drive a BiCMOS 
realization process. 
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Designing and Prototyping a Wireless 
Point-to-Point Link with Multimedia 
Application Support 


Andreas G. Savvides, IEEE Student Member 


Abstract-- We present a wireless point-to-point link prototype 
that provides multimedia application support. A point-to-point 
link is the most fundamental form of computer 
communication that is used to interconnect several 
components of a network ranging from connections between a 
pair of desktop computers to connections between local area 
networks. Supporting multimedia on a wireless point-to-point 
link can enable the use of multimedia across wired and 
wireless QoS LANs connected with such links thus providing a 
wider range of QoS coverage. We describe an adaptive 
medium access control (MAC) protocol that makes bandwidth 
considerations for such multimedia applications. To 
demonstrate our idea, we have constructed a prototype that 
can be used with existing off the shelf applications. In this 
paper we present the prototype architecture and we report on 
our initial results and experiences. 


Index Terms-- Wireless Testbed, Point-to-Point Links, 
Prototype, Multimedia Support 


I. INTRODUCTION 


Wireless local area networks are under rapid deployment in 
the past few years. Different types of wireless portable 
devices find applications in many different settings ranging 
from hospitals, airport terminals, conferences, stock 
exchanges to home networking applications. Besides the 
delivery of traditional datagram traffic, new wireless 
terminals are also expected to deliver time sensitive 
information such as audio and video. Example applications 
for this are teleconferencing and videoconferencing. To 
sustain such applications, some minimal Quality of 
Service(QoS) requirements must be met. The most common 
form of these requirements can be expressed in terms of the 
minimum bandwidth and the maximum tolerable delay . In 
a wireless setting, such requirements present a greater 
challenge than their wired counterparts, because of the 
unpredictability of wireless links, and the scarcity of 
bandwidth resources. Furthermore, the broadcast nature of 
wireless channels requires the use of specialized medium 
access control (MAC) protocols that would guarantee the 
delivery of information and make efficient use of the 
wireless channel. 


Department of Electrical and Computer Engineering 
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In this paper, we examine multimedia support on a 
wireless point-to-point link. A point-to-point link is the 
most fundamental method of computer communication. It 
can be used to connect two desktop computers or to connect 
entire local area networks (LANs). Possible uses include the 
connection of LANs in adjacent buildings in a campus 
setting or connectivity support between a mobile station 
and a fixed LAN. The proposed algorithm can be used in 
conjunction with the IEEE 802.11 MAC [3] for wireless 
LANs or it can be used as a standalone protocol. In its 
default mode the IEEE 802.11 MAC protocol does not 
make provisions for multimedia support. It is, however, 
possible to implement multimedia support in the Point 
Coordination Function (PCF), which can coexist with the 
default Distributed Coordination Function (DCF). In such 
case, each node will need to register with the polling 
coordination function during the contention period, and 
transmit the data during the subsequent contention free 
period. Examples of other protocols PCF based protocols 
can be found in [4] and [5]. Here we present an adaptive 
polling based algorithm that provides multimedia support in 
a testbed prototype that we have developed. We focus on 
the most basic form of the algorithm for the purposes of 
describing our prototype implementation, Further 
optimizations are also possible and they are currently under 
investigation and will be included in future publications. 
We begin our discussion with a description of the wireless 
point-to-point link protocol in the next section. In section 
Ill, provides a description our prototype architecture. 
Section IV discusses the results and experiences from 
constructing this prototype. Section V discusses the lessons 
learned and section VI concludes the paper. 


II. POINT-TO-POINT ALGORITHM 


To achieve high utilization, wireless channels must be 
managed effectively at the MAC layer. Since the MAC 
layer controls all the packet transmissions on the wireless 
channel and in most of the cases it is responsible for 
handling collisions and retransmissions. In the proposed 
MAC protocol, a polling based mechanism is used to 
achieve collision avoidance and yield high utilization. The 
polling mechanism can allocate the required bandwidth 
resources to each application by adjusting its polling 
frequency across the various applications. To compensate 
for any changes in the channel conditions, and to handle 
bursts effectively, an adaptive mechanism is also employed 
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that makes the required adjustments on the polling 
sequence. 


A. Setup and Assumptions 


The setup we investigate consists of two nodes one of 
which is dedicated to the arbiter and the other the client. 
The arbiter is responsible for generating the polling 
sequence and the client is driven by the arbiter in the sense 
that it can only transmit packets when polled by the arbiter. 
At each node, traffic is classified into two sessions, a real- 
time session (RT) and a non-real-time session (NRT). A 
classifier examines the packets and buffers them into two 
distinct queues, one for each session. As implied by the 
labels, the RT session consists of the traffic originating 
from multimedia applications and has certain bandwidth 
and delay requirements. The NRT session consists of all 
other traffic generated by the node. We further assume that 
the RT traffic comes from one or more multimedia 
applications that have the same requirements that are 
known at the MAC layer. If each RT application has 
different requirements a session for each of such application 
can be allocated. For simplicity purposes, we assume that 
each station has only one RT and one NRT session each 
consisting of a single application. With this assumption 
admission control is also not required. It also implies that 
enough bandwidth is available to cover the needs of the RT 
applications and some bandwidth is left over for control 
signaling and to transmit some NRT packets. 

Polling is done by transmitting a permission to 
send (PTS) packet. Thispacket is typed as PTS/RT or 
PTS/NRT for the RT and NRT sessions respectively. A 
node can respond to a PTS message with a DATA packet or 
a nothing-to-send (NTS) packet. All the received packets 
are also acknowledged by an ACK packet. Obviously, 
arbiter polling is done internally so no packets are sent on 
the wireless channel, when the arbiter polls itself. 


B. Protocol Details 


The arbiter node generates a polling sequence by 
maintaining a set of state variables for each of the 
participating sessions.’ These variables are: the polling state 
limit (PSL), the polling state variable (PSV), and the polling 
state counter (PSC). PSL is the maximum number of cycles 
that a session can remain idle without being polled. PSV 
denotes the polling frequency for a session; PSV = 0 means 
that the session is polled in every cycle, PSV = 1 means that 
its is polled every other cycle and so on. PSC is a counter 
for the number of cycles remaining until the next poll for a 
session. Initially, and after each time the session is polled, 
the value of PSV is set to PSV. A cycle is one full iteration 
of the polling sequence. The basic algorithm of for the 
arbiter in given in figure 1. 

On the client side, the packet headers for the 
DATA packets include the queue sizes for the RT and NRT 
queues. By considering the queues sizes and the number of 
retransmissions, the adaptive mechanism can adjust the 
polling sequence to adapt to the new requirements. By 
doing so, it can compensate for changes in the channel state 
and handle possible packet bursts 


//Nabel the sessions as 

Client RT session <0 
Arbiter RT session < 1 
Client NRT session < 2 
Arbiter NRT session < 3 


arbiter() 
begin 
DONE < FALSE 
while (DONE if FALSE ) 
begin 
CurrentSession = ( CurrentSession + 1 ) mod 4; 
if (PSC [ CurrentSession ] = 0 ) 
begin 
if Client RT 
Poll Client for RT Packet 
SetTimer 
if Arbiter RT 
Poll arbiter for RT Packet 
if Client NRT 
Poll client for NRT Packet 
SetTimer 
if Arbiter NRT 
Poll arbiter for NRT Packet 
PSC [CurrentSession ]=PSV [CurrentSession] 
DONE=TRUE 
end 
else 
begin 
PSC [ CurrentSession ]--; 
end 
end 
end 


Figure 1 Basic Arbiter Algorithm 


Also if a session is polled and it has nothing to transmit, the 
protocol will check whether the other session on the same 
node has something to transmit, if not then an NTS packet 
is transmitted and this change is taken into consideration at 
the adaptive algorithm. In the next section we provide the 
details of our prototype construction and testbed setup. 


III. PROTOTYPE ARCHITECTURE 


To evaluate the algorithm described in the previous section 
we implemented a prototype platform that using off the 
shelf components. Our prototype has a dual purpose. First, 
to allow us to test and demonstrate our algorithm with 
existing real applications. Second, to assess the feasibility 
of implementing this and other similar algorithms on the 
platforms used here. Since at the time of the development of 
this platform no formal development tools were available, 
this feasibility test can provide useful appreciation of the 
platform capabilities. This prototype can be used as a 
starting point to jumpstart more elaborate implementations. 
Our platform is made up of Pentium based PCs running 
Windows 95 and equipped with the PRISM I wireless PC 
cards from Harris Semiconductor[6]. These wireless cards 
feature the PRISM radio chipset and the PcNet Mobile 
MAC controller from Advanced Micro Devices[7]. This 
platform was chosen because it is reprogrammable. New 
MAC protocols can be installed on the wireless network 
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interface card (NIC) by downloading the appropriate 
firmware to the on board flash memory. The commercial 
version of the PRISM I PC card ships with an 
implementation of the IEEE 802.11 MAC protocol. For the 
purposes of our project, we have replaced the commercial 
firmware with the firmware we have implemented. To be 
able to use the new firmware we have also implemented the 
corresponding device drivers for widows. This device 
driver was developed as a miniport inside the Microsoft 
Network Device Interface Specification (NDIS). As it will 
become obvious later on, the device driver is used as a tool 
to reduce the development complexity of the firmware. The 
details of our development are described in the next 
subsections 


4. MAC Development System 


To allow flexibility and further experimentation we first 
constructed a MAC development system that provides a 
basic framework for the development of experimental MAC 
protocols. Our design goal was to provide a highly 
programmable platform for rapid protocol prototyping and 
performance evaluation of new MAC protocols. Our 
approach aims to reduce the complexity of the firmware by 
moving the complexity of the MAC development in the 
device driver. The new firmware developed implements a 
simple receive/transmit (RX/TX) functionality. When a 
packet arrives from the network it is delivered directly to 
the device driver and when the device driver decides to 
transmit the packet, it passes the packet to the firmware that 
transmits the packet. The device driver has complete control 
of both descending and ascending traffic flows thus it can 
have complete control over the media. To facilitate protocol 
development and packet handling in the device driver, we 
have developed a basic API that reduces the platform 
depended programming requirements for developing new 
protocols. A detailed description of the development system 
is provided in [1]. 


B. Multimedia Support Implementation 


Using the development system described in the previous 
section we implement the algorithm from section II. Since 
the framework is already implemented, at this point we only 
need to add the components specific to this implementation. 
The modules added are a MAC classifier, a pair of packet 
queues for buffering the RT and NRT packets, the polling 
response module and in the arbiter node we also add the 
arbiter adaptive mechanism. The positioning of the modules 
in the arbiter and client nodes is shown in figure 2. All 
descending packets are delivered to the device driver for 
transmission on the wireless channel. Upon entry in the 
device driver, packets are differentiated into the two main 
supported sessions, RT and NRT by the MAC classifier. 
The MAC classifier examines the source and destination 
port numbers in the IP header and inserts the packet into the 
appropriate queue. Once a node receives a poll then the 
polling response module checks the appropriate queue for 
packets. If a packet is available it , transmits the packet, 
otherwise it checks if any packets are available in the other 


queue on the node. If the other queue has a packet, it is 
transmitted, otherwise a NTS packet is transmitted. 


5 


Figure 2 Multimedia Support Implementation 


IV. TESTBED MEASUREMENTS 


To verify our prototype we have constructed a testbed 
consisting of an arbiter and a client node. Also, to assist 
performance evaluation, a monitor node has been added. 
The monitor node does not participate in the 
communication between the 2 nodes. It is merely a passive 
observer that records the traffic that flows through the 
wireless channel. We have developed this as a debugging 
aid for our protocols and to take measurements of the 
protocol performance. The monitor node is running a DOS 
based application we have developed that collects the 
packets that are transmitted on the channel. The results 
presented in this section were obtained by analyzing the 
traffic traces collected by the monitor node. 

In our experiments we used an FTP application to 
generate the NRT traffic and a mp3 audio application for 
the RT traffic. The FTP application was chosen because of 
the aggressiveness. When started, the FTP application will 
attempt to take up as much bandwidth as possible. We use 
an audio application for the RT traffic because it is easy to 
observe any degradation in the quality of the audio if the 
bandwidth requirements are not met. The RT and NRT 
applications require an HTTP and a FTP server 
respectively. In our setup both these servers are running on 
the client node with their corresponding client applications 
running on the arbiter node. 

To demonstrate our prototype we performed three 
types of experiments. The first set of experiments are the 
control experiments, in which we record the traces of the 
NRT and RT applications when only one of them is running 
on the system thus without any competition for bandwidth 
resources. The results are shown in figures 3 and 4 for the 
NRT and RT respectively. In the second and third 
experiments, we start the FTP application 30 seconds after 
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the mp3 application is started and we let it run for 50 
seconds. In the second experiment the multimedia support 
is not used. As expected, the audio quality degrades 
considerably as soon as the FTP application is started. The 
trace for this experiment is shown in figure 5. In the third 
experiment, the audio quality does not degrade when the 
FTP application is started. As it can be seen in figure 6 , the 
FTP application can only use the bandwidth that is not 
required by the audio application. Based on these results we 
can demonstrate that our algorithm can make the necessary 
bandwidth provisions for multimedia support. 


V. LESSONS LEARNED 


This prototyping effort has provided useful insight to a real 
implementation. Our prototype can demonstrate the correct 
execution of the protocol but we note that the polling 
mechanism imposes considerable load on the CPU. This 
load can be alleviated by migrating the polling mechanism 
into the firmware so it can run on the MAC embedded 
processor rather than the host system CPU. Such migration 
need only take place after a protocol has been verified on 
the existing prototype. The development system we have 
constructed was very useful coding different versions of the 
protocol in a short period of time. The most valuable part of 
our implementation, however was the monitor node. This 
proved to be _ indispensable during debugging and a 
practical tool for collecting traffic traces during the 
performance evaluation phase. 


VI. CONCLUSIONS 


We have described a basic point-to-point link algorithm for 
multimedia support and demonstrated it using prototype 
testbed. This prototype enables the use of widely used 
applications for testing the point-to-point protocol. It also 
provides the basis for a more elaborate system on which 
more optimizations and better performance can be achieved 
for multimedia support as well as other specialized MAC 
protocols. 
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Figure 5 Mixed traffic without multimedia 
support 
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Abstract - The implementation of wireless communications 
inside office buildings requires the profound knowledge of 
signal penetration within buildings. This paper contributes to 
the modeling of propagation losses and measurement 
techniques in modern office buildings by presenting 
measurement results in 2.4 GHz. The measurements were 
conducted within National Technical University premises in 
Athens, Greece. The purpose of the measurement campaign 
was to study the radiocoverage of the three - floor story 
building of Electrical Engineers. The measurement procedure 
involved field measurements of a spread spectrum system. The 
measurement system and experimental procedures are 
described and analysis results are presented. Specifically, path 
loss is presented as a function of true and covered distance as 
well as comparative plots for the corridors and rooms of the 
three floors of the building. Path loss contour plots are also 
being presented. 


I. INTRODUCTION 


Mobile and personal communications have brought a 
revolution in telecommunication services and have changed 
the way that people use them. In addition, they have become 
the main strength which is leading the development of 
telecommunication systems and services. 

Indoor wireless communication encounters great difficulties. 
The objects (structure, equipment and people) surrounding 
the transmitter and the receiver, cause the three problems of 
direct propagation: reflection, diffraction and scattering. The 
performance of wireless communication systems inside 
buildings is bounded by these problems. Therefore, a 
prerequisite to the design of such systems is the knowledge 
of radio propagation characteristics inside buildings. 

This paper focuses on a spread spectrum system for which 
both transmitter and receiver are located inside a building. 
Spread spectrum technics were firstly used as an answer to 
the special needs of military communications and _ this 
application still remains the predominant. However, there is 
a number of non military applications under consideration 
which will soon invade in personal life. 


Il. EXPERIMENTAL CONFIGURATION 
1. Instrumentation 
1.1. Signal Source 


The transmitter hardware shown in Figure 1 consists of a 
Spread Spectrum Modem and a RF unit with maximum output 
power 26 dBm operating in the ISM band. The spread spectrum 
modem operates on a constant rate of 2 Mbps using Differential 
Quadrature Phase-Shift Keying Modulation (DQPSK). The RF 
unit transmits at center frequency of 2.43 GHz and receives at 
center frequency of 2.472 GHz using a bandwidth of 20 MHz. 
The 0 dBi gain omnidirectional antenna is mounted on mast of 
2.5 m height. 
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Figure 2: Mobile Unit Set-Up 


aac ee eee 
326 10” Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


0-7803-6290-X/00/$10.00 ©2000 IEEE 


1.2. Measuring receiver 


The receiver hardware shown in Figure 2 is located on a 
specially designed movable measuring unit (cart) suitable for 
indoor measurements. It consists of an omnidirectional 
antenna (0 dBi gain), a Low Noise Amplifier (LNA) with 30 
dB gain, a Spectrum Analyzer receiver, a GPIB card and a 
Laptop PC for data recording. The total received power 
within the 20MHz bandwidth of the spread spectrum signal 
is measured, while the receiver antenna height is 1.5 m 
above floor level. 


2. Measurement environment 


The measurement campaign was held within National 
Technical University premises in Athens, Greece. The 
measurements were conducted at the three - floor story 
building of Electrical Engineers. A ground plan of the 
measurement environment is shown in Figure 3. The 
transmitter was located at corridor 1 of the third floor. 
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Figure 3: Ground plan of second and third floor 
respectively 


Each floor consists of offices of 35 m? - 70 m’ placed at the 
both sides of two main corridors. The offices are separated 
by soft partitions, and the furnishing includes wooden desks, 
bookshelves, computer terminals and peripherals. The inner 
side of the offices consists of concrete wall with one or two 
wooden doors for each office. The soft partitions between the 
offices are made of plasterboard or glass. The biggest outer 
side of the building is covered with windows placed in 
aluminum frame separated by concrete posts. The other 
outer side of the building consists of concrete walls and two 
big windows at the end of each corridor. 
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Between the two main corridors there are small rooms where 
measurements are not possible to be held, as well as, two 
areaways whose sides are build of bricks made of glass. 

During the measurements the number of people in corridors 
varied from 2 to 10 persons. The measurement campaign took 
place during the working hours, while the personnel were 
present leading to the worst case of propagation. 


3. Measurement Technique 


The measurement campaign involves measurements of the 
signal strength along the corridors and a set of measurements 
inside the offices of the three floors of the building. 

The measurements along the corridors were static every 2/4 i.e. 
3 cm, in order to observe the fast fading. The starting point in 
every corridor was the edge closer to the transmitter. For the 
computation of the mean signal strength inside the offices a set 
of measurements took place along the perimeter of a square in 
the center of every office or where the arrangement of the 
furnishing made it possible. Moreover, in each case the total 
number of the measurements conducted was proportional to the 
size of the room [2]. 


Ill. EXPERIMENTAL RESULTS AND ANALYSIS 
1. Measurement results 


For the processing of the data stored in the PC the calculation of 
the path loss at every measurement point was necessary. 

Typical plots of the path loss as a function of true and covered 
distance are presented below (Figure 4 and Figure 5), as well as 
comparative plots for the corridors (Figure 6 and Figure 7) and 
rooms (Figure 8) of the three floors of the building. Comparative 
plots are composed of the log-normal curve which best fits the 
corresponding path loss curve. Path loss contour plots for the 
third and the second floor are also being presented (Figure 9). 


Path Loss (dB) 
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Figure 4: Path Loss as a function of true distance for corridor 1 
of the third floor (LOS). 
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Figere 5: Path Loss as a function of covered distance for Figure 8: Comparison between rooms - path loss for third and 
corridor 2 for second floor (NLOS). second floor. 
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Figure 6: Comparison between corridor I - path loss for 
each floor 
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Figure 9: Path Joss contour plots for the third and the second 


floor. 
| 2. Experimental analysis 
ee ee From the data processing and the plots given above, the 
: ts following results are derived: 
Figure 7: Comparison between corridor 2 - loss ¢ The signal strength along the corridor 2 of the first floor was 
third and second floor — oa too weak for the corresponding data to be taken into account. 


e A raise of the signal strength is noticed at every corridor, 
e.g Figure 5, except corridor 1 of the third floor, at the 
distance of 2.5 m - 4.5 m from the beginning of each 
corridor. This could be attributed to the existence of an 
areaway build of bricks made of glass located at the above 
distance (Figure 3). 
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e Along corridor 2 of the second floor the signal is stronger 
than that of corridor 1, which is located exactly below the 
transmitter. This could be explained by the existence of 
the areaway mentioned above since only bricks of glass 
are posed between the transmitter and the receiver for 
certain points of corridor 2 (2.5 m - 4.5 m). 


Using the corridor data penetration loss between floors and 
corridors is been calculated: 


Penetration Losses Between Floors 

e Penetration loss between the third and the second floor is 
38.6 dB for corridor 1 and 14.3 dB for corridor 2. This 
difference is caused by the existence of the areaway 
mentioned above. 

e Penetration loss for corridor 1 between the second and 
the first floor is 10.4 dB. 


Losses Between Corridors 

e Losses between corridor | and corridor 2 are 23.6 dB for 
the third floor and -0.7 dB for the second floor. It should 
be noticed that the signal along corridor 2 is stronger than 
that along corridor 1 (for the second floor). 

e What is more losses are reduced by 1 or 2 dB at the points 
where there are no walls between the corridors, while at 
the points where there are walls built of bricks made of 
glass (areaway) losses are reduced by 4 or 5 dB. 


Penetration losses between floors and corridors are presented 
in Table I and II respectively. The log-normal curve 
equations of the corridors for each floor as a function of true 
and covered distance are gathered in Table III. 


The following conclusions can be derived for the path loss 
exponent (n) from the log-normal curve equations 
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Table I: Penetration Loss between floors 
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Table II: Conclusions for the path loss exponent (n) for Corridor 1 and Corridor 2, for 1", 


e For corridor 1 of the third floor n=1.9 which approximates 
the free space path loss exponent (n=2}. Being smaller than 
n=2 implies waveguide phenomenon. 

e For corridor 2 of the third and the second floor n=3.9 and 
n=4.2 respectively which approximate the plane earth path 
loss exponent (n=4). 

e For corridor 1 of the first floor n=6.3 which is due to 
multifloor propagation. 


III. CONCLUSIONS 


Propagation measurements results at designated ISM band 
for a Spread Spectrum system were examined. From the field 
trials propagation experiments described in this paper, indoor 
radio channel characteristics have been derived. The spatial 
fluctuations of received signal envelope were recorded. Path 
loss models have been derived in both LOS and NLOS cases. 
A synopsis of the best fitting parameters is given and 
quantitative models are proposed. The standard deviation of the 


- fading signal correlates with the amount of people activity in 


the vicinity of the radio wave propagation. 
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| Total | 23.6 | -0.66 
[Areaway | _19.6 | -5.9 


Table Il: Loss between corridors 
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Ginis Web — The Tool for GIS Applications 
Development Based on Web 


Ana Stojanovic, Leonid Stoimenov, [EEE Student Member, Aleksandar Dimitrijevic, Slobodanka 
Djordjevic-Kajan, IEEE Member 


Abstract- Internet is overwhelmed by large variety of GIS 
applications. Very often it is so hard to find information you 
need. There are all kinds of spatial information systems, ones 
with large raster maps and sufficient information that need 
time to load. On other side there are fast systems using vector 
data that can show a world to you in just a second, but not so 
sufficiently. When you want to put your information system on 
the Web you have to choose between one of these two ways of 
presenting GIS information. We have chosen one. These 
papers present a simple way to organize your GIS data on the 
Web. Actually, this is just a prototype. 


Index Terms—GIS, Web, spatial database 


I. INTRODUCTION 


There are two basic ways to deploy GIS applications on 

Web. They can be deployed as server-side or client-side. 

First ones require less software on client-side but their 

performance is poorer and user has almost no freedom in 

querying. Also GUI is very primitive and information has to 

be re-transferred for each request from server to client side. 

Then, it is possible to implement information system in 

many different ways no matter if they are client or server 

based. Some well-known ways are by using: 

e different kinds of agents [4], 

e earlier defined user profiles [5], 

e static HTML pages, which are updated earlier [6], 

e several server applications connected into one system. 

First two approaches are client oriented the others are 

server oriented. Troubles appear while implementing either 

of them. 

These problems include adherence to standards and 

performance disadvantages [1]. At client-side systems 

adherence to standards problems are usually that 

e In order to load an application (in our case Java applet) 
need to buy additional software, 

e  Platform/browser incompatibility may appear. 

Also there are problems concerning performance, meaning 

that: 

e Initial download times can be substantial if database is 
transferred, 

e Users must wait for software to download 

e Overall performance can be low with large databases 
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Problems that appear when using server-side approach may 

seem smaller since these applications are much faster, but: 

e functionality is reduced to one click functionality for 
Web clients 

® quality of graphics is very low 

e GULis primitive. 

We deployed our software as client-side application. It’s 

disadvantages seemed to be minor the time we started to 

deploy our prototype. 


II. OUR SOLUTION 


There is a long way from an idea to just one simple GIS 
prototype. Even though it is simple its deploying follows 
the logical steps found in its many predecessor systems [2]. 


Figure 1. GIS Workflow 
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This system uses several ways to collect data. Since files 
are used as underlying layer for an applet these files can be 
formed in three different ways manually using simple text 
editor. By the way, this can be very hard as records are 
formatted. The other way is by using simple applications 
StreetManager and ObjectManager, which automatically 
access these files and store records in predicted format. 
Still, this means that someone has to type in all required 


- data. Third way, the most proficient one, is to convert this 


data from already existing databases into formatted files. 
Maintaining and editing data is done using already 
mentioned applications or through DBMS and forming new 
text files. 

Finally, presenting data is very simple. Images, files and 
Java applet are wrapped together using just one small 
HTML file. This file contains information about folders 
GIS data is stored at, about amount of data needed for 
initialization and provides simple interface. 

For-the prototype images used are in JPEG and GIF format, 
since these formats are standard for Web. Applet code is in 
Java and can be interpreted both on Windows 3.x/NT and 
UNIX systems by Internet Explorer 4.0 or Netscape 
Navigator 4.5. 


A. System Architecture 


As mentioned before, this is a client-side GIS application. 
All required data is loaded from the server-side at the 
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moment client requests certain HTML page. Since different 
data including images and text files is needed for applet to 
initialize, it is recommended for the size of data packages to 
be small. That way time needed for the initialization will 
reduce. 


Figure 2. Client-side architecture 
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As it is shown on Figure 2 all work is done on client side 
once applet is loaded. Applet gets necessary data from files 
stored in local cache and refers to server only when 
requested data does not exist there. This way user does not 
have to wait after every query, like with server-side 
applications. As size of packages needed to transfer from 
server to client side decrease user action response time 
reduces. Communication between server and client side is 
direct from applet. Applet gets data from server location 
quoted in HTML file, under <APPLET> tag using Java 
Applet method getImage. 

Applet refers to server only at the moments requested data 
can not be found in local cache. This happens when user 
makes great movements while scrolling the city plan. 


B. Applet 
This is what appears in the browser window every time user 


loads page http://gislab.elfak.ni.ac.yu/PlanGrada. 


Figure 3. Applet appearance 


Opening window shows raster image supplemented with 
some vector data. This data consists of hyperlinks and text 
information concerning important city objects and streets. 
User has ability to move through city plan continually using 
scroll bars or jump from site to site selecting items from 
drop-down lists. As it can be seen user is given freedom to 
choose type of objects to be highlighted, then to choose 
between objects of the same kind to be shown or to choose 
a street from the street database. Every one of these actions 
causes a change of background raster image and vectored 
object data in local cache. 


Ill. BASIC PRINCIPLES OF IMPLEMENTATION 


Power of this software lies in its simplicity. Since HTML 
file is underlying, by just changing parameters and paths in 
APPLET tag of HTML code or images and files on server 
side we can use same software for completely different 
cities. 

Background for vector data is raster city map. The idea is to 
split that image into small pieces, so that their transport 
over the Web would not influence much on time needed to 
load the page. In the prototype map is split into 80 pieces. 
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At the initialization point only parts of marked images are 
visible so those images are the only ones which are loaded. 
When a user, while scrolling a map, makes a small 
movement, some parts of visible images stay visible while 
the others become invisible and are replaced by parts of 
other images (Figure 5). This is one of the moments when 
applet starts communication with server, requesting the 
images, whose parts became visible but are not stored in 
local cache. The other situation is when user makes a big 
scroll or jumps using drop-down lists. Then complete 
background is changed. Image vector gets new data and 
response time grows. After new images are loaded vectored 
data gets changed. So if user does not start listing data 
before images are loaded, no slowing down will be noticed. 


Figure 5 Image field imgs{][] 
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As it is shown on the figure above, screen can overlap at 
least four (red rectangle) and nine images at most (blue 


=a. gp eee 
332 10” Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


rectangle). In both cases there is a room to scroll without 
loading new images to imgs[][] field. When scrolling 
overlapped images appear, they are not loaded again, 
instead they are moved to convenient position in image 
field. There is corresponding text file city_name.objects 
containing data about important city buildings and 
institutions shown on every particular image. So, after 
image is loaded, corresponding data file is loaded using the 
same index values. 

Objects are classified by types (17 of them) including 
stations, schools, faculties, cultural institutions, markets, 
city and government institutions, sport objects and so on... 
When user moves above the object on the map hand cursor 
is shown and status line shows information about homepage 
for the object. 

Streets are stored in different file and not directly connected 
to images as objects are. At initialization point all streets 
are numbered in drop-down list: Street list. When user 
chooses one street from the list, absolute geometric 
coordinates of the street beginning and street end (in pixels, 
off course) are read from the file city_name.streets. These 
values are used to count indexes of visible images in city 
map. 


IV. SUPPORTING APPLICATIONS 


The simplest way to gather data, necessary for applet to 
function correctly, is by using supporting applications 
StreetManager and ObjectManager. These applications 
have simple and intuitive GUI and it is very easy to use 
them. For objects, user types in names and links and 
coordinates of upper left and bottom right corner are taken 
after mousePressed and mouseReleased_ events, 
successively. 


Figure 6. Application ObjectManager 


Figure 7. Application StreetManager 
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V. CONCLUSION 


It is very hard to decide how to present your data on Web in 
order to make it interesting and informative for viewers and 
shorten download times at the same time. People are not 
patient enough with Web, because it costs money; so 
whenever you want to attract people you have to 
compromise between amount of information and time 
needed to present it. We hope that our software does that. 


VI. FURTHER PLANS 


It would be convenient to try replacing files with 
underlying database and use JNI and RMI in order to give 
more functionality and information to existing system. It 
could happen that this solution slow data loading since then 
communication between server and client side would be 
more often. Anyway, one of the most important things to do 
is to reduce size of raster images, simply by changing them 
to less colorful ones and with less image information. The 
main idea is to replace them with completely vectored data 
and to draw objects in applet using already existing files, 
which are now used for navigation only. 
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Integration of GIS Data Sources over the 
Internet Using Mediator and Wrapper 
Technology 
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Abstract--This paper presents ORHIDEA project which 
provides intelligent integration of information using mediation 
technology. ORHIDEA is used for integration of spatial data 
from GIS databases, alphanumerical data from RDBMS and 
other data sources. 


Index terms --mediator, information integrati on, GIS legacy 
databases 


I. INTRODUCTION 


Geographic information systems are computerized systems 
for managing data about spatially referenced objects. GIS 
differ from other types of information systems in that they 
manage huge quantities of data, require complex concepts 
to describe the geometry of objects and specify complex 
topological relationships between them [1]. In addition, GIS 
data is typically used by various groups of users with 
different views and needs. 

New generations of applications, such as GIS, have much 
more demands in comparison to possibilities, which could 
provide legacy information systems and traditional database 
technology. Most current geographic information systems 
assume and present the static world. However, information 
that exists in different spatial database may be useful for 
many other GIS applications. Also, numerous legacy 
systems should be coupled with GIS systems, which present 
additional difficulties in developing end-user applications. 
Because of that, there is a need to provide communication 
and collaboration between these applications. Most of large 
organizations have huge amount of data collected and have 
invested in many legacy information systems. Nowadays, 
new applications, such as GIS, can be created combining 
existing information in a wide information system. 

In traditional collaborative engineering, information is 
managed in an ad hoc manner, with data maintained in 
application specific files and legacy databases. Data 
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exchange between these systems usually performed by 
executing scripts and programs that transport data. This 
approach may work with small projects, but it is inadequate 
for distributed environments. 

Today, the only way to integrate huge amount of available 
data is to build specialized applications based on mediators 
[5]. Information mediators provide an intermediate layer 
between information sources and users. Nowadays, there is 
a strong trend of information systems integration in chain of 
systems among public information structures such as 
Internet. However, no one wants to share his own 
information with public. This is the reason that integration 
is interchanging data among the applications without 
centralized control. Another trend in GIS is publishing 
maps for World Wide Web community and development of 
web-based GIS applications. 

The goals of our research activities, described in this paper, 
are defining an architecture for integration of distributed 
and heterogeneous GIS data sources, identifying the need 
for mediators and wrappers in telecommunication network 
administration, and adding the integration technology to the 
GinisNT [2,3] spatial database. We examine a research 
whose final goal is to make disparate data sources work 
together. This work is known as information integration. 


II. INFORMATION INTEGRATION 


Data integration application begins from a set of pre- 
existing data sources. These sources might be database 
systems but more often are unconventional data sources, 
such as legacy systems, structured files, e-mail files, and 
www pages [8,9]. Most importantly, a data integration 
system lets users focus on specifying what they want rather 
than thinking about how to obtain the answers. As a result, 
it frees them of combining data from multiple sources, 
interacting with each source and finding the relevant 
sources. A data integration system provides a uniform 
interface to a multitude of data sources. 


A middleware system, based on mediator and wrapper 
technologies, separates the storage of data management 
facilities, providing common interface for application. 
Information mediators were originally developed for 
integrating information in databases [4,5]. Three-layer 


a a aa TS | TERE aE TS UNS SET ae ee 
334 10° Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


0-7803-6290-X/00/$10.00 ©2000 IEEE 


mediator architecture provides transparent view of data 
sources and independence of data sources and user 
applications. 


Applying the mediator framework to the Intranet/Internet 
environment solves the difficult problem of gaining access 
to real world data sources. Internet provides the underlying 
communication layer and protocols for mediation of 
distributed systems. A wrapper is a program that is specific 
to every data source [6,7]. Wrapper extracts a set of tuples 
from source file and performs translation in the data format. 


Mediation concept is a part of the ARPA [3 (Intelligent 
Information Integration) reference architecture. The [3 
reference architecture should be seen as a vision of how 
vast amount of heterogeneous information can be 
incrementally pulled into a gigantic, reusable library of 
information resources [6]. 


In order to simplify network programming and make 
component based software architecture real some 
middleware technologies are developed on paradigm 
programming level. Two, already existing, distributed 
object models tend to be standardized. These are DCOM 
[10] and CORBA [11]. These high-level programming 
paradigms can be used to support deploying software 
components, platform- independent, which co-operate using 
TCP/IP network. These technologies are relatively new and 
there is so little experience in combining them in order to 
support huge information integration systems used in 
enterprise organization. 


UI. ORHIDEA PROJECT 


The research group at the Computer Graphics and GIS Lab 
at the University of Nis, Yugoslavia has been developing 
GIS software for eight years now. GinisNT is a scalable, 
OO environment for the development of GIS applications 
that is built on top of a relational DBMS the usage of which 
is made transparent to the user completely [12,13]. The 
goals of our research activities are defining an open 
architecture, identifying the need for mediators in 
telecommunication” network management, adding the 
integration techniques to the GinisNT spatial database and 
realize the active mediator level which perform integration 
tasks. We have developed Internet based version of 
GinisNT, called ORHIDEA. Projects goal in the ORHIDEA 
is to make it simple for users the use of different data 
sources in their GIS applications ORHIDEA is a 
middleware, mediator system that provides data interchange 
and access to distributed data sources without changing 
how or where data is stored. 


Our primary goal with ORHIDEA project was to design a 
platform and methodology that could serve for information 
exchange over the Internet between distributed and web- 
based GIS sources. However, the web’s browsing paradigm 
does not readily support retrieving and integrating data 
from multiple sites. The only way to integrate the huge 
amount of available data is to build specialized platforms. 


The ORHIDEA platform uses agent-wrapper and mediator 
technology to allow communications between GIS 
applications over the Internet/Intranet. Wrappers are types 
of software “glue-ware” that are used to attach other 
software components. Mediators are “complex”, higher 
level component, that combine data from different data 
sources. The mediator and agent-wrapper approach also 
allows maintaining these applications (data sources) and 
incorporate new sources, as they became available. 


The general approach to mediation in ORHIDEA is based 
on our previous work on the GinisNT architecture with 
GinisNT Active Mediator layer. The GinisNT Active 
Mediator layer provides an intermediate level between 
information sources and user GIS applications. 


Mediators 


Translator i 


Translator | | Translator 


eC 


GinisWeb Server 


Figure 1. ORHIDEA platform 


The architecture of this system is shown in Figure 1. 
ORHIDEA is set of distributed software components, which 
provide GIS information integration. System provides 
"virtual" database schema, and user application are 
operative system independent. Client application can be 
placed on Web or stored on any workstation. Data sources, 
which should be recognized by our system are spatial 
GinisNT database (and any other commercial spatial 
database), ActiveMediator and GinisWeb interface for 
GinisNT, relational databases and other legacy information 
resources. For every data source in heterogeneous 
information system there is a translator (or wrapper) which 
converts logically base data objects to common information 
model. The set of information agent/wrappers implement 
the middle-layer linking services. This layer perform 
mediator functions which include transformation of data 
and mapping between data models. In order to make this 
logical translation, translator converts query’s to requests 
through information from common model. These requests 
can be executed by data source. Translator also converts 
data returned from data source to common model. 


Mediators are above translators. Mediator is a system, 
which filters information from one or more data sources. 
Mediator builds in knowledge, required for processing 
specific information. Mediators should hide existence of 
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various (heterogeneous) data sources; they should abstract, 
collect data for users and translate data and schemas. 


Important system features are that there is no global 
database schema and that mediators can work 
independently. For mediator design it is necessary to know 
only which data sources it will access and format of data 
returned by source. Complete understanding of used data 
source features is not necessary. 


TV. CONCLUSION 


This paper focuses on the components of the ORHIDEA 
platform that are needed to support the mediating functions 
in this architecture over the Internet/Intranet. In this 
architecture we are used new technologies, such as 
mediators, wrappers, and agents. 


Project ORHIDEA, described here, uses _ intelligent 
information integration using mediators as link between 
data source and user applications. System should provide 
actualization of client/server applications, using Internet 
and Web technologies as under-layer for network service 
and integration of distributed data sources. Mediators 
provide easy information access and no need for creating 
new database exists, "virtual" database schema, which gives 
user data stored in existing databases is created. Mediators 
are efficient and cheap way of integrating data from 
heterogeneous information systems. Logical consequence is 
that mediators provide evolving system development and 
use existing software investments in software and 
databases. 
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Effective ways for querying images by content 
over the Internet 


M. Vlachos', G. Vardangalos’, C. Tatsiopoulos’ 


Abstract 
Increasing amounts of images have made explicit the need 
for more effective and efficient image indexing and search of 
not only the metadata associated with an image but also 
retrieval directly on the content of the image. The scope of the 

COBWEB project (Context-Based Image Retrieval on the 

WEB - EP28773) is to create a pioneer system that will provide 

a solution to the problem of automatically storing and 

retrieving images from large image databases dispersed over 

the Internet. The project focuses on the following issues: 

e Development of effective algorithms for the automatic 
storing of the visual features of the images that can be 
used as a means for retrieving similar images. 

e Creation of a simple, cross-platform user interface that 
helps the user to query the database in the most natural 
way. 

e Development of a robust communication subsystem that 
guarantees satisfying retrieval times. 

In this paper, we will describe the architecture and 
implementation details of. the Human Computer Interface 
(HCI) and the Intelligent Search Component (ISC), which are 
two of the main components of the COBWEB system. 
The HCI consists of a graphical interface, which enables the 
user to interact with the system. It provides the user with 
various approaches in querying the database including 
keywords specification and sample image submission. The ISC 
main tasks are to handle and translate the query criteria 
submitted by the user through the HCI, navigate the database 
and find images that satisfy those criteria and return the 
results to the HCI. 
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Index Terms 

Content-based image retrieval, image database, keywords, 
visual features, algorithms, cross-platform, Java, Internet, 
feature extaction, query, HCI, ISC, FEC 


I. INTRODUCTION 


Increasing amounts of images have made explicit the 
need for more effective and efficient image indexing and 
search of not only the metadata associated with an image 
(e.g. annotations, captions, keywords) but also retrieval 
directly on the content of the image (visual features). 

The scope of the COBWEB project (Context-Based 
Image Retrieval on the WEB - EP28773) is to create a 
pioneer system that will provide a solution to the problem 
of automatically storing and retrieving images from large 
image databases dispersed over the Internet. The project 
focuses on the following issues: 

e Development of effective algorithms for the automatic 
storing of the visual features of the images that can be 
used as a means for retrieving similar images. 

e Creation of a simple, cross-platform user interface that 
helps the user perform queries and retrieve images in 
the most natural way. 

e Development of a robust communication subsystem 
that guarantees satisfying retrieval times. 


In this paper, we will describe the architecture and 
implementation details of the Human Computer Interface 
(HCI) and the Intelligent Search Component (ISC), which 
are two of the main components of the COBWEB system. 


II. OVERALL SYSTEM ARCHITECTURE 


Figure 1 shows a simplified diagram of the overall 
COBWEB system structure. It presents in a simplified 
manner the physical architecture of the approach and 
clarifies how functionality is distributed across the different 
elements of the system. Furthermore, it shows how those 
elements communicate with each other. 
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Output 
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COBWEB 
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Figure 1 


COBWEB is implemented as an Internet / Intranet- 
based application consisting of three logical tiers. The user 
does not interact directly with the image database but 
through a_ user-friendly, graphical interface that 
communicates with a search engine. The search engine’s 
intelligence lies in its ability to navigate the image database 
and find images that can possibly correspond to the user’s 
demands. The thumbnails of those images (and full-sized 
previews) are available to the user interface for 
downloading through the COBWEB’s web server. Thus, the 
search engine functions as an intermediate tier between the 
user interface and the image database. 

Figure 2 explains the purpose of each functional 
component of COBWEB in the context of the system’s 
general architecture. Interactions between the components 
of the system are also shown. 

The Human Computer Interface (HCD) is the user’s 
view of the COBWEB system. The user submits keywords, 
sample images or sketches as criteria for his queries. The 
above criteria are sent directly to the Intelligent Search 
Component (ISC) i.e. the search engine. The ISC uses the 
algorithms provided by the Feature Extraction 
Component (FEC) to “translate” those sketches or images 
into meaningful (for the ISC) feature vectors. Then, the ISC 
searches into the database (Database Component -DC) for 
images that satisfy the criteria using the algorithms 
provided by the Feature Comparison Component (FCC). 
Specifically, the ISC compares the keywords and the visual 
feature vector of the sample image with the respective 
keywords and feature vectors of the database’s images. 
After the interaction with the DC has finished, the ISC 
retums to the HCI a ranked list of image IDs. Finally, the 
HCI fetches those images’ thumbnails (or larger, full-size 
preview images) from the web server and presents them to 
the user. 


Human 
Computer 
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ae Feature 
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Component 


Intelligent 
Search 
Component 


Feature 
Comparison 
Component 


Database Component 


Figure 2 


At the following paragraphs we will present the architecture 
and implementation details of two of the main components 
of the COBWEB system, the Human Computer Interface 
(HCI) and the Intelligent Search Component (ISC). 


II]. HUMAN COMPUTER INTERFACE 


A. Overview 


The HCI consists of a graphical interface, which enables 
the user to interact with the database. It is targeted to 
inexperienced people who do not have extensive knowledge 
about matters such as computers or image processing. Thus, 
it provides an intermediate solution between effectiveness 
and simplicity. At later phases of the software 
implementation the HCI will enable experienced users to 
choose to follow more sophisticated and accurate query 
procedures. 


B. Function / Processing 


The HCI provides the user with various approaches in 
querying the database including: specification of keywords, 
submission of sample images (possibly loaded from the 
local file system), rough sketching-of the image’s general 
view (with the use a simple, embedded drawing utility) and 
a query refinement mechanism, based on the results of the 
previous retrieval step. 

The query procedure consists of the following steps: 


1) Submission of query criteria to the ISC — The HCI 
transmits through the network to the ISC the keywords 
and the sample image the user has submitted. 
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2) Retrieval of images and presentation of results to the 
user - The HCI receives the IDs of the matching 
images from the ISC. Then, it requests the resulting 
image thumbnails from the web server and presents 
them to the user. Obviously, the best matches are 
presented first. 


After that first retrieval step, the user can do one of the 
following: 


@ Query refinement 
The user navigates the results from the previous retrieval 
step, chooses the images that match best to the image he is 
searching for and asks for a query refinement. 

e Alteration of a previously retrieved image 
The user selects an image from the results, uses the 
embedded drawing utility to alter it and make it resemble to 
the image he is looking for and re-queries using it as a new 
sample image. 


C. Implementation 


A basic requirement for the HCI was to be developped as 
a cross-platform application. Therefore, it has been 
implemented as an independent Java application 
conforming to Sun’s Java 2 standards. 


D. Future work 


One of COBWEB’s main targets is to implement the HCI 
as a downloadable Java applet. For the time being, the 
above solution is not feasible because of the following 
reasons: 

e Currently, browsers do not support the Java 2 
platform. 

e The embedded drawing application makes the 
code quite lengthy for downloading. 

e Java 2 execution times are long, especially when 
dealing with complicated user intefaces. 


IV. INTELLIGENT SEARCH COMPONENT 


A. Overview 


The Intelligent Search Component (ISC) is the search 
engine of the COBWEB system and is responsible for the 
execution of the queries to the image database. The retrieval 
process is triggered by the query request, submitted through 
the HCI. Functionally, the ISC serves as an intermediate tier 
between the HCI client and the database. 

ISCs power lies on the use of effective and efficient 
mahematical algorithms, capable of extracting a number of 
well-defined features (visual features) from images of 
arbitrary size. A brief description of the nature of the 
algorithms integrated in the first prototype of the COBWEB 
system follows: 


e Global color histogram 
e Local color layout 


e Coarse level color distribution of homogeneous 
regions 
e Large scale color texture 


Detailed description of the algorithms mentioned above is 
beyond the scope of this paper. For more information see 
references 3,4,5 and 6. 


B. Function / Processing 


The ISC receives the user’s query criteria from the HCI 
and navigates the database to retrieve the images that satisfy 
the user’s needs. A detailed description of the query 
procedure follows: 


e Query reception from the HCI 

The ISC receives query criteria from the HCI. As 
mentioned before the query criteria consist of a sample 
image and a set of keywords. 


e Extraction of visual features of sample image 

The ISC uses the algorithms provided by the Feature 
Extraction Component (FEC) to create a feature vector 
containing visual information about the sample image. 


e Database navigation 

Initially, the ISC searches the database to find a subset of 
images whose keywords match the ones selected by the 
user. After the selection has been done, the ISC retrieves 
from the database the feature vectors of the images of that 
subset. Then, the feature vector of the sample image is 
compared with the respective one of each image of that 
subset. The above goal is accomplished with the help of the 
similarity algorithms contained in the Feature Comparison 
Component (FCC). 


e Score sorting 
The ISC sorts the similarity scores the FCC created for each 
target image of the subset. 


e Results aggregation 

The ISC selects the images with the best scores and returns 
the results to the HCI, in the form of image IDs. The 
maximum number of returned images is subject to the end — 
user’s preferences. 


Figure 3 is a visual representation of the procedure 
described before: 
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C. Implementation 


The ISC is implemented with the help of the Microsoft’s 
Visual J++ environment. Microsoft’s solution has been 
adopted instead of Sun’s pure Java solution (Java 2) in 
order to ensure the communication with the FEC and FCC 
components, which follow the standards of Microsoft’s 
Component Object Model (COM). 


D. Future work 


Obviously, feature extraction and feature comparison 
procedures are time-consuming procedures and the ISC is 
considered a time-critical application. Moreover, Java is not 
considered the best platform for the execution of 
computational tasks. Therefore, it has been decided that the 
next version of the ISC engine will be implemented with 
the help of a more powerful environment / language such as 
Microsoft’s Visual C++. 
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Global Music Creation Over The Internet 


Marko Privosnik, Student Member, IEEE 


Abstract--The article presents a system under development 
that enables globally distributed interactive music creation, 
manipulation and presentation, all in real time. 

The system is based on the concept of music stream, which is a 
sequence of building blocks that make up music. The 
client-server architecture of the system allows Internet users 
to download and to listen to the stream. On the other hand, it 
allows each user or software agent to introduce new musical 
building blocks (audio samples, musical events, etc.) to an 
already existing music stream. The server merges these 
building blocks to a local music stream and distributes it to all 
of its clients and other connected music stream servers. The 
overall emergent result is a single and distributed, but not 
fully homogeneous music stream, slightly different for each 
participant due to distribution delays and local (spatial and 
temporal) merging solutions. 


Index Terms--Emergence, Internet Music, Multi Agent 
Systems 


I. INTRODUCTION 


Internet, as a global network interconnecting millions of 
people, allows all users to interact and exchange ideas in 
real time regardless of their location on the globe. The 
objective of our project is to develop a new paradigm of 
music creation that uses the possibilities of Internet and 
enables people to interactively create new musical worlds. 


A number of Internet applications have appeared over the 
last years that enable people to communicate, exchange 
ideas and co-operate with each other. Internet Relay Chat 
(IRC), for example, is used for chatting and discussing 
different topics, while Multi-user dungeons (MUDs) enable 
people to create and enter virtual worlds, where they co- 
operate with other users. Applications that are using a 
similar concept, but in contrast to exchanging textual 
information, allow people to exchange musical information 
also emerged, like Net Sound project [1]. Other appearing 
applications enable users to cooperate in an interactive and 
creative music environment over the network [2], [3], or 
make possible distributed multi agent systems to generate a 
musical composition [4]. 


In our project we are building a new and different Internet 
application based on the concept of music stream. The 
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application will enable every Internet user to participate in 
the process of creating new musical worlds and will enable 
people to exchange ideas and interact with each other in 
new and exciting ways. Other similar projects are 


II. Music STREAM 


The concept of music stream presents a new paradigm of 
how music is created and performed. It is a basis of a 
system that makes possible globally distributed interactive 
music creation, manipulation and reconstruction, all in real 
time. 

Music stream is a sequence of building blocks that make up 
music (figure 1). The building blocks can be very different 
in their role and complexity (i.e. sound definition, note 
definition, control message, synchronization message, 
coordination message, etc.). They are interpreted in 
succession according to their time on and validity 
parameters. The result of the interpretation has one or a 
series of effects at absolutely or relatively defined places in 
the represented music (i.e. in an infinite loop a sample is 
played periodically). The quality of the interpretation, 
which is done by a rendering engine, is depended on 
specific system’s capabilities. 
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Fig. 1. Music Stream as a sequence of building blocks 
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Fig. 2. The propagation of new building blocks over the system. Music stream characteristics (building blocks) are represented with 
different visual patterns in rectangles. Client's new building blocks are shown in its lower rectangle. In step | two clients (2 and 
4) uploaded new building block to their servers (B and C). In step 2 both servers simply integrated received blocks into local 
stream. In step 3 servers B and C tried to synchronized their streams and found different merging solutions. In step 4 both 


servers tried again to synchronize streams and succeeded. 


Til. THE GLOBAL ARCHITECTURE 


The presented system consists of a network of globally 
connected servers and their clients. Connected servers allow 
global distribution, consistence and persistence of a music 
stream. Clients make possible to download and/or to upload 
music stream to a server. On one hand, the client-server 
architecture of the system allows clients to receive music 
stream from a server. Received music stream can be then 
rendered in real time and listened. On the other hand, the 
architecture allows clients to introduce new musical 
building blocks to an already existing music stream. These 
blocks are packed as a music stream and transmitted to a 
server. 


A music stream server can receive new building blocks 
from its clients and/or from connected servers. The server 
then merges streams with new building blocks to its local 
music stream and distributes it again to all of its clients and 
connected music stream servers. Potentially, every client on 
every server Can participate in music stream creation, what 
makes merging process a complex task. If the received 


stream is not in conflict with the server's local music stream 
or other incoming streams, then it is simply integrated into 
the server’s local stream. Alternatively, if conflicts arise, 
they need to be solved by the server, which uses a 
probabilistic algorithm that tries to find the best 
compromise and to synchronize multiple received music 
streams with the server's local stream. We are 
experimenting with algorithms that support cooperation and 
competing of users for the influence over the global music 
stream development. Example in figure 2 shows the 
propagation of new building blocks over the system, where 
slightly different music streams need to be synchronized. 


The overall emergent result is a single and distributed, but 
not fully homogeneous music stream, slightly different for 
each participant due to distribution delays and local (spatial 
and temporal) merging solutions. But globally, music 
stream evolves as a single entity. The resulting, distributed, 


music is made up not only from temporal, but also from 
spatial harmonies. 
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IV. AGENT CLIENTS AND MANIPULATORS 


In addition to the human users, special music stream 
software agents can also manipulate and co-create global 
music stream. Music stream software agents are software 
entities that are logically equivalent to the human 
participants in the stream creation process. Typically they 
are utilizing some algorithmic composition technique. An 
agent can run as a complete music stream client or just as a 
bot or plug-in inside a more general programmable music 
stream client. Agents are an essential part of the system 
because they keep the music stream alive. Even if there is 
no user activity, agents will create the stream. At every 
moment, there should be at least one active agent present in 
the system. On the other hand, if there are human 
participants, agents compete and cooperate with humans 
and are equal with them. In this way, music becomes a 
living entity, independent, but still responsive to human 
stimuli. 


Instead of manually defining building blocks, human 
participants may interact with the global system in a more 
natural way by using manipulators. Manipulators are virtual 
or actual devices that translate user actions on them into 
music stream building blocks. They can be everything from 
keyboard, mouse and similar conventional devices (with 
corresponding software) to devices like beam controls and 
similar. More advanced manipulators may incorporate a 
feed back mechanism, that- translate music stream back to 
manipulator and have an effect on it, what would make 
possible users to physically interact by means of music 
stream. 


V.- CONCLUSION 


We believe that Internet is becoming an environment for 
global co-operation in many different areas of our every day 


lives. Internet is not affecting only the way we work, but 
also the way we spend our free time, communicate and 
socialize. The future Internet will transform the way we do 
creative work from a self-centered approach to a 
collaborative effort of many people, where the global result 
will emerge from real-time interactions and contributions of 
each individual. 


The presented system, which is our first step in this 
direction, is still in the early development phase. We are 
currently developing and improving software for music 
stream servers and clients. We plan to implement and test 
the first full version of the client and server part of our 
system in the next months. In the final stage of the project 
we will publish our application and system specification on 
the Internet. We believe that the final application will fulfill 
our expectations. 
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IP telephony from a user perspective 
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Abstract—This paper discusses issues of IP telephony. What 
can we expect from such communications methods, what kind 
of benefits does IP telephony bring, and what drawbacks can 
users expect in comparison to POTS telephony? Solutions for 
connecting VoIP terminals (ranging from PC phone to IP 
phone) and other functional entities of IP telephony, such as 
gateways and gatekeepers, are presented. Some IP telephony 
services are mentioned, and possible IP phone functions and 
applications, which any user would appreciate, are described. 
We conclude with an interesting questionnaire as a prediction 
of what users can expect from IP telephony. 


Index terms—Internet, Voice over IP, H.323, Services, 
Applications 


I. INTRODUCTION 


IP Telephony or VoIP (Voice over IP) enables users to 
transmit voice and fax or simply "phone calls" over data 
networks using the Internet Protocol (IP). IP telephony is 
sometimes confused with Internet telephony, but IP 
telephony is a much broader term, as it includes Intranet 
and Extranet communication. The transmission of voice 
over IP requires conversion of an analog voice into a digital 
stream of data packets. Packets are then routed through the 
data network from one user to another, where they are 
converted back into voice. This kind of transmission has 
certain advantages, such as rational use of existing 
networks, especially for corporations and companies with 
large volumes of international voice traffic. Home users can 
also benefit from making international VoIP calls at much 
lower rates than any other POTS (Plain Old Telephone 
Service) provider can deliver. Another benefit of IP 
telephony is the relatively easy establishment of new 
service providers, as they can use _ existing 
telecommunications networks. 


IP protocol was not designed to transfer voice, but it can 
allow transformation of almost any kind of data. The IP 
telephony world differs significantly from POTS, where the 


throughout the duration of a call, even when users don’t 
speak. In contrast, in IP packets travel from the point of 
origin to the destination along different routes and arrive 
with different delays. The way in which the voice will be 
converted into a data stream depends on the product used. 
The conversion can be done with or without compression, 
but only standardised protocols should be used. 


One major drawback in IP telephony today remains 
network availability. The Internet is a highly unstable 
network with many constantly breaking links and 
interrupted connections. Another thing users dislike is 
delays. Delays are rapidly increasing in line with traffic on 
the network. When someone uses email, web or file 
transfer, the delays are not so important, but in telephony 
delays exceeding 400 ms highly degrade the quality. It 
should be stressed that, bogged down with severe packet 
latency and several other problems, Internet Telephony 
today remains unacceptable for many, if not most, business 
applications. In contrast, Intranet Telephony doesn’t suffer 
from similar problems and is becoming acceptable in all 
environments, including business. Intranets experience 
more predictable levels of packet delay, have higher 
transmission rates and allow better user management 


For users, the IP telephone should be just another simple 
way to call another user on the other side of a ‘line’. 
Therefore, the IP telephone must at least provide services 
common to today’s normal PSTN or ISDN phones, while 
offering some sort of advantages, which appeal to users. 
This could mean additional services, new functions, easier 
use, and lower cost of services, e.g., transmission of voice 
and all supplementary services are obligatory in IP 
telephony. Because IP is a data network, some new 
services, such as a universal messaging (In, Out) box, 
integration of web browser, FTP client, online web-based 
shopping and others, can be introduced. In order to describe 
some of these services, we must provide some basic 
information about standards for interworking between 
packet and circuit switched networks. The description will 
be given in the next section. At this point we could ask why 


connection between two users is circuit switched anybody would switch form standard to IP telephony. There 
is always a minority of techno-enthusiasts always willing to 
sacrifice their time and money for new products available 
aie of Blbcerical En pisses ine on the market. ; For business however, a much more 
Universtiy oc avian important factor is the cost saving by using IP telephony. 
Tr2aSka 25, 1000 Ljubljana, Slovenia 
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II. VoIP SOLUTIONS 


The first IP telephony products were software programs 
which ran on PCs (Personal Computers). The connections 
were thus PC-to-PC type (dashed line in Figure 1). 


Side Buh ee 


Workstation 


Figure 1: PC to PC telephone call 


This configuration is the simplest, since the conversion 
from voice to data and back is performed on a PC using a 
sound card and appropriate software. As we can see in Fig. 
1, calls can only be made within the data network, or rather, 
within the Internet. Such a principle requires that both users 
are online during the conversation. This implies that users 
have to plan their calls in advance, or one of them must call 
the other over a standard phone in advance. Only then can 
they start a VoIP conversation. This makes a call very time- 
consuming especially when both users are connected to the 
Internet through a modem connection. 


The next step in VoIP evolution was a gateway. A gateway 
is a node, which interconnects data networks (LAN, 
Internet, WAN) and POTS networks. Some ISPs offered 
new services for IP telephony via their gateways. Now 
users could call other party from their PC to POTS 
telephone. An example is shown in Figure 2, where user A 
is calling user B to a standard analog phone. 


PC phone 


Side B () 


o) 
c— 
Vide 


ISP's VoIP 
gateway 


Figure 2: PC to POTS call 


As we can see in Fig. 2, the gateway performs the 
conversion from a packet data stream into an analog voice 
signal. Gateways were first introduced on the called side 
(side B). This enabled users to dial telephone numbers, 
which were in the gateway's domain. Program solutions like 


VocalTec, Net2Phone, MediaRing Talk and others are all 
capable of calling somebody outside the IP world. A call 
from user A travels through the Internet to the gateway 
nearest user B. This is important because user A pays for a 
call from the gateway to user B's destination, as well as the 
gateway services and his Internet connection. For example 
MediaRing offers a telephony service from Europe to any 
location in USA toll free. The authors did test this and 
established quite a good connection from Ljubljana 
University (PC phone with MediaRing software) to 
Princeton University (campus POTS telephone). Calls to 
other locations in the world are subject to payment 
according to the ValuePhone price-list. 


The next step in VoIP evolution is a gateway for bi- 
directional calls. With such a gateway, users can call each 
other via IP from analog telephones. If the ISP offers such a 
service, User A (Fig. 2) first dials his ISP’s service 
telephone number and then as an extension the telephone 
number of user B. Gateway A translates this number to the 
IP address of gateway B (the one nearest to user B) and 
forwards the call. The relations are shown in Figure 3. Such 
a service can be of great interest to users who make many 
international calls and who would like to cut down on 
expenses. The price of such a conversation can be very low, 
at least until ISP’s are not able to offer any QoS (Quality of 
Service). 


Figure 3: Phone-to-Phone call via IP (Internet) 


An alternative solution in VoIP is a hardware version of the 
IP phone. With this, users could gain a number of new 
services and applications. While making phone calls 
according to the scenario in Fig. 3 is very suitable for 
individuals such as home users, the IP phone could prove 
very interesting for big companies and organisations. The 
use of IP phones could bring them considerable benefits. 
The most important benefit is a reduction in costs for 
implementing, managing and maintaining one network only 
instead of two. Consider a large company with Intranet 
implemented on a LAN (Local Area Network) with 
dislocated offices connected through a WAN (Wide Area 
Network) of leased line data links. As leased lines are quite 
expensive and mostly unoccupied, the transmission of voice 
over these lines enhances traffic throughput. Tests 
performed in the Laboratory of Communication Devices [1] 
showed that 10 IP voice channels can be transmitted over 
40 % preloaded 10 Mbit Ethernet based LAN with 


10” Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 345 


acceptable quality. Figure 4 shows an example of an IP 
voice connection in a company with an internal LAN and a 
dislocated office connected to it over a2 Mb WAN link. 


A complete solution appears to be the most expensive, as 
the initial expenses are fairly high for a company, which 
has already, installed a telephone network. 


ISP VoIP 
gateway 


Internet, Intranet, 'P router 
Extranet 


‘alog 
hone 


An. 
VoIP adapt pl 


IP phone 


Figure 4: A complete VoIP solution 


However, if we consider the major savings in telephone 
charges and network management, such a solution can be 
economical in the long term. Many major 
telecommunications equipment manufacturers are already 
offering complete VoIP solutions, ranging from simple IP 
telephones and adapters for analog phones to gateways and 
gatekeepers. 


II]. STANDARDISATION 


Although end users are usually not concerned with 
standards, it is worth mentioning that almost all products on 
the market are based on the H.323 protocol suite published 
by the ITU-T [2]. An H.323 network, with its functional 
entities, is shown in Fig. 4. 


The H.323 terminal is an end-user device also known as an 
H.323 client. It provides real-time two-way voice, video or 
data communication with another H.323 terminal. The 
client could be a multimedia PC, an IP phone or a terminal 
adapter that connects an analog phone or fax machine to the 
H.323 network. The gatekeeper provides address translation 
and call control services to H.323 endpoints. It is also 
responsible for bandwidth control, authentication, 
authorisation and accounting. The gateway in an H.323 
network performs the actions already described above. In 
short, a gateway connects switched and data networks. The 
connection is possible through conversion of signalling 
protocols and media transmission formats between the 
terminals. In addition, the gateway can operate with other 
ITU networks such as: PSTN, ISDN, BISDN, ATM and 


others. It can also operate as a Multipoint Control Unit 
(MCU). MCU supports multi-conferencing between three 
or more terminals or gateways. MCU is in fact a conference 
server with centralised signalling. 


IV. SERVICES IN IP TELEPHONY 


The Services are the main issue from a user perspective. In 
fact, users don’t need to know how the phone is built, how 
it operates and what kind of network it runs on. They just 
want to use them. 


In telephony, the primary and obligatory service is, of 
course, voice transfer. IP telephony should aim for the same 
QoS as offered by circuit switched networks. The 
supplementary services known in ISDN and cellular 
networks as Call transfer, Call diversion, Call hold, Call 
waiting, Call park and pickup, Message waiting etc. should 
also be provided by IP telephony They are already defined in 
H.450 ITU-T recommendation series [3]. Users will probably 
never switch from an ordinary telephone to an IP telephone if 
the services are inferior. Moreover, the IP telephone should 
offer some additional functionality not available in ordinary 
telephones in order to gain customers. 


A. New functionality in IP telephones 


As the IP telephone is already connected to a data network, 
its connection to a personal computer or dedicated server in 
the same network is seamless. This raises a series of new 
possibilities 


Call identification (CLIP) is already known in ISDN or 
cellular networks. It displays the caller telephone number, 
or his name if the appropriate entry is in the local phone 
book. As an enhancement to this service, an IP telephone 
could query an enterprise or even a public telephone 
directory containing data such as the caller’s name and 
address. Users could thus identify all incoming calls, 
regardless of the entries in their local phone book. 
Similarly, outgoing calls can be made using the public 
telephone directory. 


The connection of an IP telephone to a PC will allow 
management and operation of the telephone from a PC on 
the one hand, and the maintenance of a log of all incoming 
and outgoing calls on the PC on the other. 


Going one step further, for users who do not need a PC in 
their daily work, a luxury version of an IP telephone with 
embedded web browser and e-mail client will be an 
attractive way to provide Internet access. 


With a proper gateway, users anywhere in the network will 
have access to GSM short messaging system, WAP 


(Wireless Application Protocol) and other information 
services. 


Messages, voice and data will easily be broadcast to many 
IP telephones, and so the IP phone could serve as an 
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interphone as well. Conference calls involving users with 
traditional telephones, PC based IP phones and dedicated IP 
telephones will be as easy as ordinary phone calls. 


Users will log on to an IP telephone like they log on to a 
computer. Phone charging could then be made on per-user 
instead on per-phone basis, which would greatly enhance 
user mobility. Many users could log on to the same IP 
telephone to accept incoming calls without needing to 
worry about forwarding calls when leaving their office. 


Logging on to an IP telephone with a smart ID card will 
make the logging on procedure very simple and, will 
furthermore offer a range of new possibilities, from online- 
shopping and payment to accessing a bank account. The 
preferred messaging system for users would be one 
universal (In and Out) box for voice, fax, SMS and e-mail 
messages. The convergence of these services will enable 
users to manage fax, e-mail and voicemail from a single 
application on a dedicated VoIP terminal. 


There is a great deal of possible additional functionality, 
which we can, or cannot yet think of. The functionality of 
the IP phone will depend mainly on the software in the 
phone. The user will be able to download new releases of 
the software (and thus new functionality) from the Internet, 
and in all likelihood he will be able to subscribe to 
automatically upgrade software. 


The survey is the best way to establish which services, 
functions and applications are important to users. Figure 5 
shows the results of a survey conducted by MultiMedia 
Telecommunications Association [5]. The question posed to 
a group of CT (Computer Telephony) users was: “How 
useful are each of the following CT features? (Answer rate 
on a scale of 1 to 10 where 10 is extremely useful an 1 is 
not at all useful)”. As we can see call switching and 
automated routing were identified as the most important 
services, followed by CLIP and screen-pops. Name dialling, 
VoIP and FoIP fared marginally. This example tells us that 
VoIP without supplementary services, enhanced 
functionality and new applications will not be sufficient to 
persuade users to migrate to the new technology. 


Usefulness of CT features 
Call switching to other ES er 
location within org. 


Automatic calll routing 
Caller ID (CLIP) 
Screen-pops 

DNIS 

Dialing by name 

Call switching to other 
location outside org. 
VoIP (Internet Telephony) 
FolP (Fax over Intemet) 
Screen transfer 

Screen sharing 

Other 


Average response 


Figure 5: Usefulness of CT features [5] 


V. CONCLUSION 


The services we described are suitable for implementation 
by corporate networks, ISPs and new generation telecoms. 
VoIP solutions are already in use by many companies and 
corporations and private home users. The cost saving is 
considerable, especially for long distance calls and in the 
event of the construction of new communications 
infrastructure with only one wiring on the company 
premises. Users can immediately upgrade their PCs to 
enable VoIP, with many software solutions available on the 
market today. According to research study [5] many 
supplementary services and applications similar to VoIP are 
already in use today in companies as CT features. Thus the 
next logical step would be to implement such features in IP 
phones as new applications. Such an enhancement would 
greatly increase possibilities for VoIP users, lower the cost 
for equipment and infrastructure, and simplify their 
communication needs. 
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Predictive Playout Delay Adaptation for Voice 
over Internet 


Viktor Matic, Alen Bazant, Miaden Kos 


Absoect— Packet switchims S am cfiective techoolegy fer 
imterrating voice and data im 2 Savile wetwerk. Az example for 
seach 2 Kind of 2 network Ss Internet. New servike Ss emerzing, 
aul it is 2 Voice over Interact Protecel (VelP) Au impertzat 
aspect ef packet-switched wee the recemsirection af 2 
OptinMeErs Stream of speech from the set ef packets that arrive 
at the dGesimaten DIE. Because of the statistical mature of 
packet switchims, each packet cam eaceumier 2 different 
amneent of delay im traversing 2 packet peiwerk. The prodiem 
desorbed im this paper is how t desen 25 apprepriate voice 
FECES Stien Siretesy that repredmces acceptable quality 
specch from packets that arrive with waryims delay. The basic 
idea propesed im the paper is te mensare 2 delzy jitter between 
several pelts on the Interect. Our vee trafic semerater, 
implemented on 2 workstation, semds packets that are Dims 
lees. Receiwer mexsares imterarrival time, keeps track of 
mixed sequences and lest packets. The messurement results 
are Siven im 2 form of averese delzy Sitter. Receiver alse keeps 
2 recerd ef all relevant packet information im 2 text file that 5 
wsed 25 am imput t 2 delzy jitter predicter. Trafic prediction 
element predicts telispart characteristks and directly 
inflwences playout delzy. For 2 prediction clement. we progese 
Adapive Neare—-Purzzy Inference System (ANFIS). 


I INTRODUCTION 


are witnessmes cvoleiom of Inmet m 2 real 
mulnmedie network Even wah all mhenied poor 


more and more of 2 globel traffic on the World Wide Web, 
whach is one of the most popolzr services om the Infemet. 
At ths pomt Iniemet has mo Quality of Service (QoS) 
gumamies. Kk only offers best effumt service, md om the 
other end, apphceton such as realimme woice transport 
Gemunds stict or at least bounded traffic parameters. 
Packet networks have great capebility for 

sod optimal utilization of available bandwidth [3]. i i one 
of the mam reasoms for transporting woe over packet 
networks. A gumber of techmcal problems exist m sending 
ae premeniy omented t recomsiractimg voice stream om 2 
receiver side [4], [I]. Statsncal mate of packet based 
WERSHESSION CARSSS Varying packet imterarrival time (jitter). 


If packets of 2 given call ae routed midcpendenily, 
where cach of them may follow a different path through the 
network, packets may amive at the destination out of onder 
charactenmsiics, the only way t ovemide jitter 5S i smooth 
ee ee 

iter iS 2 problem for micreciive voice apphcaioss, 
winle for sreamme apphcatons ¢.¢. disimoe leammg, we 
cam always mitrodiace additional pleyout delay that is large 
enough, and @ cam compensate jitter, while sill keepms 
user uaaware of that. 
[10] All of them are based on additional delaying of 
received packets. The major question is what is the opamal 
delay. If we chose 2 pleyout delay that is to bave, # would 
detenorate specch quality im such 2 mammer that echo and 
speaker's woice wall overlap. On the other hand if iter s 


The best way is to hitp track of all chien 
osciiamens dummg session and t ocontimmously adapt 
playout deley. This methodology is possible only if woace 
tWamsmuiter docs mot sencrate packets dumme silence 
periods. So, alll the changes in delay on receiver side Should 
take place between telkspurts [2], [3], [10]. Am end user will 
mot notice them because the amount of changes is small and 
they seem hike longer or shorter pauses im specch. With this 
approach we are sure that the same application will work 
with best possible quality under all conditions im 2 network. 

Te acineve this goal we need t know charactenstics of 
jitter mm advance. Hence, main component of smoother on a 
receiver sade is 2 jatter predictor. in tis paper we present 


For design and testme purposes we have used real 
twaffic captered from Intemet rather then simulated one [6]. 


TL LEARNING AND TESTING DaTA 


The Kies wes to eomilate real voice applications, but to 
avoid 2 commection setup. Emulation conforms to Real-time 
Transport Protocol (RTP) [1], using User Datagram 
Protocol (UDP) on the transport layer. To generate learning 
and testing dat we were using small wansmiter 
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application, that sends equally time spaced UDP packets, 
with raster of 20ms. Application sends a set of 5000 packets 
of different lengths (40, 60, 80, 100 and 512 bytes) for 
throughput testing. Different packet lengths correspond to 
different degrees of voice compression and/or different 
sampling frequencies. Measurement for each set of packets 
is repeated 5 times during working hours of day. Moreover, 
the measurement was repeated in two different directions 
(Dublin - Zagreb, Graz - Zagreb). 

Standard Unix command snoop was used as a receiver. 
It captures interarrival times, sequence number and number 
of received packets. 

Based on the number of sent and received packets we 
can choose the best (minimal loss) packet length. Averaged 
results approximated with cubic spline are shown in Fig. 1. 
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Fig. 1. Percentage of received packets (5000 packets were sent). 


For various links and various periods in a day this 
function can look quite different. But we can still make 
some conclusions: shorter packet length introduces lower 
loss rate. 

If packet loss is less then 10%, speech quality is 
acceptable. For learning predictor we have chosen series of 
captured data for packets that are 40 octets long. 

Before actual training process we have to calculate jitter 
by formula given in RFC 1889 [1]. If S; is the RTP time 
stamp from packet i, and R; is the time of arrival in RTP 
time stamp units for packet i, then for two packets i and j, D 
may be expressed as 


Dii, j) =(R, — R,)-(S; -S,) =(R, -$;)-(R,-S;) 1) 


The interarrival jitter is calculated continuously as each 
data packet i is received from source, using this difference 
D for the packet i and the previous packet i-1 in order of 
arrival (not necessarily in sequence), according to 


((DG-1,i)|-J) 


16 


J=J+ (2) 


This algorithm is the optimal first-order estimator. Gain 
parameter 1/16 gives a good noise reduction ratio while 
maintaining a reasonable rate of convergence. 

It is possible to use prediction techniques directly on 
measured interarrival time. There is a reason why we did 
not do it. First, we do not need to predict exact time of 
sequential packet arrivals, but we are interested in trend in 
which changes are happening. Second, it is difficult to train 
predictor with such data because they are too noisy. 

Jitter for direction Dublin-Zagreb is shown in Fig. 2. 
The jitter is an input for the predictor. As we shall see later, 
this data have to be formatted in a special way so that they 
can be used for neural network training. 
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Fig. 2. Jitter for captured 40 octets UDP series. 


III. ANFIS PREDICTOR 


Adaptive Neuro-Fuzzy Inference System (ANFIS) can 
be used as a prediction element. Fundamentally, ANFIS 
uses fuzzy inference system (FIS) and tunes it with a 
backpropagation algorithm based on some collection of 
input-output data. This allows fuzzy system to learn and 
adapt to specific data. Network structure facilitates the 
computation of the gradient vector for parameters in a fuzzy 
inference system. Once gradient vector is obtained, we can 
apply a number of optimization routines to reduce an error 
measure (usually defined by the sum of the squared 
difference between actual and desired output). In the neural 
network literature this process is called learning by 
example. 

~The idea is to take Sugeno-fuzzy model and then to 
create functionally identical adaptive network [5]. So, for a 
simple example we will take Sugeno-fuzzy system with two 
inputs and one output. For the first-order Sugeno-fuzzy 
model [5], a common rule set with two fuzzy if-then rules is 
the following: 


Rulel: If x is A,and y is B,, then f, = pyx+q,y+n, 
Rule2:If x is A,and yis B,, then f, = p,x+q,y+n, 


Based on the fuzzy model we can build network shown 
on Fig 3. Now we have to define node functions. 
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Layer 1 2 3 4 5 


Fig. 3 Sugeno-fuzzy model equivalent ANFIS model 


Every node i in layer 1 is an adaptive node with node 
function equal to 


Oj; = HA), 1 {1, 2 or 


: (3) 
O1, =p, 0) 1€B, 4}, 


where (x) is parameterized membership function, and it 
can be generalized bell function, as in 


1 
Ona o 


Every node in layer 2 is a fixed node, whose output is 
product of all input signals 


Oz; =w; = H4,(X)HB,(), ie {2} 6) 


Every node in layer 3 is a fixed node defined with 
function given with equation 


One pent) (6) 
WwW) +W2 


This node is also called normalized firing strengths. 
Layer 3 nodes are adaptive with node function 


O,, = wif, =wi(p,x+q,yt+n), (7) 
Parameters in a set {P:4ish'> are referred to us as the 


consequent parameters. Finally, layer 5 consists of one 
fixed node that computes the overall output 


Lipps 
0, LT 


(8) 


Thus we have constructed adaptive network that is 
functionally equivalent to a Sugeno-fuzzy model. Note that 


the network structure, shown in Fig. 3., is not unique; there 
is an option to combine layer 3 and 4 and to obtain an 
equivalent network with four layers only. 


A. LEARNING 


We can treat a problem of jitter prediction like nonlinear 
system identification. We need to extract dynamic process 
model to predict x(f) using past two candidates x(t-2) and 
x(t-1). For a learning process we have to reformat our 
training data shown in Fig. 2. All the captured interarrival 
times are placed in a vector j(n), n=1, 2, ..., 5000. Hence, 
for the learning purpose we present this data in a form given 
in (9). The complete series of captured data is divided in 
sets, and each set has three consecutive members, e.g. {j(1), 
j(2), j3)}, (2), J(3), F(4)}, etc. A general approach is to 
train the network with a combination of input and output 
values. Format needed for training is a vector where all 
members are inputs, except the last one that is an output 


[x@¢=2);x=D;x@)] (9) 


where values x(t-2) and x(t-1) represent inputs and x(f) is an 
output. 

Before network training begins initial fuzzy membership 
functions for two inputs must be known. Function for the 
input x(t-2) is shown in Fig. 4.(a). Each of the inputs has 
two associated member functions, which are generalized 
bell functions (4). 


0.02 0.025 0.03 0.035 0.04 0.045 
input variable “x(t-2)" 
(a) 


mf1 mf2 


0.02 0.025 0.03 3.035 0.04 0.045 
input variable *x(t-2)" 
(b) 


Fig. 4. Fuzzy membership functions before learning process (a) and 
after learning process (b). 


For this case, ANFIS has 21 nodes, 12 linear and 
nonlinear parameters, which gives us a total of 24 
parameters. Fuzzy set is defined by 4 rules. We have used 
500 samples of captured data for learning, but to avoid 
starting transition process we have started from 100" 
sample. Learning in 10 epochs gave us a new, modified, set 
of membership functions (MFs) as a result, shown in Fig. 
4.(b). Calculated MFs could be represented as three- 
dimensional surface shown on Fig. 5. 
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0.02 0.02 


Fig. 5. Fuzzy implication function 


IV. TESTING 


Now, we have designed and trained a network (using 
500 samples) for a set of captured data on relation Dublin- 
Zagreb. For testing purposes we can check how prediction 
works on the same set of data but on the rest of the segment 
(after 600" sample). Predicted and captured data are shown 
on the same diagram in Fig. 6, making it obvious that the 
difference between these two data sets is small. 
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Fig. 6. Segment of predicted and captured data 


Exact error function is given in Fig. 7, showing that 
prediction works fine on all the received data. Maximum 
error is less than 4ms, what is satisfactory result. 
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Fig. 7. Comparison error captured data and predicted 


We have also tested this network on data captured on 
switched LAN and on relation Graz-Zagreb. Results for 
LAN measurements are shown in Fig. 8. 


V. CONCLUSION 


As it is proved through testing, ANFIS can be used as a 
prediction element in the jitter smoother. We have shown 
that the same neural network can be used under different 
network conditions (LAN and different locations on the 
Internet). But in real implementations, prediction element 
should be constantly trained to network conditions to 
achieve the error reduction in prediction method. 
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Fig. 8. Segment of predicted and captured data in LAN environment. 


This can be easily accomplished as a background 
process in an application. In a real implementation the 
smoother would be realized to predict  talkspurt 
characteristics rather then every packet interarrival time. 
Output of predictor should give exact time to start playing 
first received packet of the last talkspurt. Playout is delayed 
by predicted time in relation to pause between talkspurts. 
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COMPUTER ANALYSIS OF LOS 
MICROWAVES LINKS CLEAR AIR 
PERFORMANCE 


Tudor Palade, Member, IEEE 


Abstract - Microwave links have to be designed such that 
propagation effects do not reduce the quality of the 
transmitted signals. Measurements and the _ derived 
propagation parameters are analysed and discussed, for Cluj- 
Napoca county, in order to improve future planning of the 
radio links. 


Index terms — multipath, fades statistics, geoclimatic factor. 


J. INTRODUCTION 


The line-of-sight radio links network has to be 
designed to meet the very high quality objectives. The 
planning involves careful consideration of the actual path 
and the propagation effects [1]. A precise prediction of the 
expected multipath fading is therefore important. There are 
no previous radio measurements on multipath activity in the 
hilly region of Cluj-Napoca county so, less information is 
available. Our days development of the national and private 
radio networks requires more data on propagation effects in 
the microwave range. 


I]. MULTIPATH MEASUREMENTS 


The Radiocommunications Laboratory of TUC-N 
carried out multipath fading measurements on four analogue 
radio relay links over the year 1995. The links pass mostly 
over hilly areas and they are designed to meet the standard 
recommendations of ITU-R that means there is enough 
clearance. Some characteristics about the links are given in 
Table I. 

The strength of the received field was measured on 
using a multichannel acquisition board. A special Test Point 
three modules software package has been written to control 
the acquisition action, to process the collected data and 
finally to produce the predicted values of the output 
parameters. The main features of these modules have been 
included in Table II. 
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The acquisition equipment takes a sample of the 
AGC signal level each second for each tested radio link 
receiver. A threshold was established for the AGC received 
signal level and a correlation between the magnitude of this 
signal and the received field in antenna was considered. 
Only the level of a new sample, which exceeds the previous 
one with more than 3%, is recorded. Following this 
procedure a database was obtained with files for each radio 
link over the whole period of measurements. Using such 
records the monthly distribution of the signal fading was 
computed and plotted resulting the worth month distribution 
for the individual link Fig.l! and Fig.2.[2] 

From the above-mentioned graphs the fade depth A; 
beyond which the distribution is approximately linear was 
noted and P; , the corresponding point for percentage was 
considered. With these pairs of co-ordinates for the “first 
tail point” the K; individual geolimatic factor has been 
computed according the CCIR Report 338-6, Annex III 
specifications. 

Averaging the values of log Kj; an average 
geoclimatic factor has been derived for our specific 
geoclimatic conditions: K,, = 5.947E-7. 

The values of K; for each tested radio link and the 
co-ordinates of each “first tail point” taken from the 
distribution curves (A; ,P; ), are summarised in Table III. 

Taken into account the resulted average value for K, 
the P, parameter was estimated using the Olsen and Terje 
method [3] and the following value has been obtained: P, = 
0.28. With the obtained value for K and P, the C parameter 
(which takes into account the terrain features) was estimated 
using the ITU-R formula: 


Ke=slOes Pp (1) 


The calculated value C = 7.27 is specific for our 
region and is with 10% greater in comparison with the value 
estimated by ITU-R for general hilly areas. 
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Table I: Radio links characteristics 
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Ground height above sea (m) 
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Antenna diameter (m) 


Measurements periods in 1995 


January-March + June-December 


Table II: Structure of the multipath analyse software package 


Acquisition module 
- radio link characteristics 
- sampling period 

- received signal 


Computing module 


- geoclimatic factor 
- radio links features 
- path profile 

- fade depth A (dB) 
- channel profile graphic 
- percentage of time that 
fade depth A is exceeded using 
for computation Method I and 
Method II. 


Statistical data processing module 


- database produced by the acquisition 
module 


Database files including 

- level of the received signal 
for each channel (mV) 

- tested channel number 

- date (dd-mm-yy) 

- time (hh-mm-ss) 


- received power vs time per channel 
- histogram per channel and month 

- fading probability distribution 

- mean value and standard deviation 
- average worst month 

- geoclimatic factor value from data 


nv a 
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Fig. 1. The worst month cumulative distribution for path 3. 
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Number 
of records 


Received field strength power (dBm) 


Fig. 2. The histogram of the received power levels for path 3. 


Table III. The co-ordinates for the tail point and the value of K for each radio link 


Path Name of the radio link 
ap fe 


Rx_TV1 F2 Cluj-Deag 2. S824E-6 


a Rx_TF_Fl Cluj-Deag 1.4825E-6_ 


emt 1p: | Rx TF FI Cluj-Zalau 
Rx TV Cluj-Zalau 


III. CONCLUSIONS 


A more accurate estimation of the K and P, values 
needs further measurements on the same radio links to 
obtain a multiannual multipath fading distribution and the 
average worth month for each path. 


1.2381E-6 
2.6354E-8 
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A New Microcell Prediction Model Based on 
the Arrangement of the Streets and their Types 


Nata&a NeSkovié, Student Member, IEEE, Aleksandar NeSkovi¢, Student Member, IEEE, and 
George Paunovi¢, Member, IEEE 


Abstract - A new microcell prediction model is presented in 
this paper. Utilizing this model some important disadvantages of 
known statistical microcell prediction models can be overcome, 
especially for the prediction under NLoS (Non-Line-of-Sight) 
conditions. The model itself is based on the arrangement of the 
streets and their types. The five submodels are defined in 
accordance with the classification of the streets. A separate path- 
loss curve is determined over measurement data for each 
submodel. In order to build the model, extensive electric field 
level measurements (in 900MHz frequency band) were carried 
out in the city of Belgrade, for two test transmitter locations. 
The comparison between the data obtained by the proposed 
electric field level prediction model and the independent 
measurement sets, have shown that the proposed model is 
accurate and reliable. At the same time, the algorithm is suitable 
for computer implementation, simple and fast. 


Index Terms — Mobile communications, electric field level 
prediction, microcell, propagation, LoS and NLoS conditions. 


I. INTRODUCTION 


Microcell mobile radio systems are characterized by low 
transmitting power and base station antennas mounted near to 
or below the surrounding rooftops (usually on lampposts or 
utility poles). The coverage area extends along the street, thus 
the name "street microcell" is often used. Microcells are 
attracting much attention simply because they can 
accommodate more subscribers per unit service area than the 
macrocell. The propagation in microcells differs significantly 
from that in macrocells. Smaller cell coverage with lower 
antenna height and lower transmission power results in milder 
propagation characteristics when compared to macrocellular 
systems. 

The prediction of the path loss, i.e. the electric field level, 
is a very complex and difficult task. Many field strength 
prediction models have been proposed in literature [1,2,3]. 
Generally, the prediction models can be either deterministic 
(environment-specific) or statistical (based on empirical data), 
or a combination of these two. 

The deterministic models [4] are based on the principles of 
physics, and due to that they can be applied in different 
environment without affecting the accuracy. Their 
implementation usually requires a great environmental 
database which is sometimes impossible to obtain. The 
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algorithms of these models are usually very complex and lack 
in the computational efficiency. 

In statistical models [1,5] all environmental influences are 
implicitly taken into account regardless of whether or not they 
can be separately recognized. The computational efficiency of 
these models is satisfying, but the main problem of these 
models usually is the accuracy. The main idea of this research 
was to develop the more accurate statistical model [6] with 
respect to the existing microcell prediction models, which 
have to remain not complex at the same time. 

The implementation of the proposed model requires two 
databases which can be easily created: © the building blocks 
database and © the street database within which all streets are 
classified into several categories. There are significant 
differences in the propagation of radio waves in different 
types of streets. Due to that, different propagation formulas 
are derived for each type of street. 

The new microcell prediction model has been realized in 
900 MHz frequency band. The selection of this frequency 
band was made because this band is very interesting for public 
(GSM) radio systems. However, it should be noted that the 
general idea of the proposed model can be easily applied to 
any specific frequency band. 

In section II environmental databases used in the model 
realisation are considered. The proposed model was 
developed and verified over data collected by extensive 
measurements. The description of the measurements is given 
in section III. The realisation of the proposed microcell 
prediction model is fully described in section IV. In the final 
section, the verification of the proposed model is presented . 


II. DATABASES 


Almost all of the electric field level prediction models use 
some environmental databases. Mostly, the accuracy of a 
prediction model significantly depends on the accuracy of the 
environmental databases. However, a good prediction model 
has to ensure satisfactory accuracy in a real time relying on 
the existing databases and eventually, on some additional 
databases not difficult to create. The implementation of the 
proposed microcell prediction model requires two databases. 
The both databases can be easily created over an aerial 
photography (fig. 1) or a precise topographic map. The 
proposed microcell prediction model was developed and 
tested using aerial photography of scale 1:2240. 

The first database is the vector building blocks database 
where all particular locations are classified in two categories 
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(building or not building). The easiest way to do this is to 
make a colour picture of the building blocks over the scanned 
aerial photography. 


B: 
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to main 


i Building blocks a——s Wide side 
e——a Main (LoS) m:+essm Narrow side 


Fig. 1. The illustration of building blocks database and street 
database shown over the aeriel photography 


The second database is the street database within which 
all streets are classified into several categories: © main street 
under LoS (Line-of-Sight) conditions, @ main street under 
NLOoS conditions, ® narrow side street (the width is less than 
12 m), © wide side street (the width is greater than 12 m) and 
© parallel street with respect to the main street (including 
backyards). Each record in the database corresponds to one 
street and includes: type of the street (narrow, wide, main, 
etc.), the coordinates of the first and the last point of the street 
and the width of the street. The street database is valid for one 
transmitter location, only. In the case that the transmitter site 
is changed, a new street database is to be formed. The time, 
necessary to form the street database, manually, for one 
transmitter location, is very short (typically, a few minutes). 
However, in the case where the database of all streets in a city 
exists, it is not a difficult task to make the process of creating 
the street databases of different transmitter locations fully 
automatic. In this manner, the street databases for the purpose 
of this work were created. 


Ill. THE MEASUREMENTS 


During the measurements 1 W CW RF carrier was 
radiated over vertically polarized 0 dBd omnidirectional 
antenna. The carrier frequency was set at 910 MHz. The 
measurements of the received electric field level were carried 
out with an Anritsu Field Level Meter ML522B which 
declared accuracy is of the order of 2 dB. To cope with a great 
quantity of data, Field Level Meter was connected to the 
laptop computer via IEEE 488.2 interface bus. The sampling 
period of this system was chosen to be 100 ms. Vehicle 
positions were recorded manually. In that way, the accuracy 
of the positioning of the order of 1 m was achieved. 

The measuring equipment was driven along the streets of 
the city on an industrial cart. The receiving calibrated half- 
wave dipole antenna was put on the height of 1.5 m above the 
street level. The speed of the vehicle was nearly constant, 


0.5m/s. In order to cope with the real propagation conditions 
the signals were measured only during the day time, taking 
into account the presence of vehicles and people. 

Prior to the measurements, the transmitter site locations 
were carefully selected. Two distant university buildings in 
the city of Belgrade were chosen for transmitter locations. 
Both university buildings are situated in the dense urban parts 
of the city at almost flat terrains, except for several side streets 
with a mild slope. At the transmitter locations, electric field 
level measurements were carried out for three base station 
antenna heights (3.2 m, 6.2 m and 9.2 m), for a total of 52 
separate measurement sets. The test routes were selected in 
such a way to take into consideration all previously described 
types of streets. 

During ten consecutive days of measurements over 
800000 samples of the field level values versus positions were 
collected. The fast fading was eliminated by averaging over 
+10A interval (for 910 MHz, it is +3.5 m). Such an interval 
was chosen to trade-off between the uncertainty of the 
estimation of the local mean and the relatively small 
dimensions of the microcell. By this assumption, the local 
means were estimated twice over two separate measurement 
data sets at each of several hundreds positions. It was found 
that the mean errors are 0.86 dB and -0.15dB, while the 
standard deviations are 2.66 dB and 3.13 dB for the first and 
the second set, respectively. The obtained values are slightly 
greater than the theoretical ones (the mean error - OdB, the 
standard deviation - 2.2dB) [1]. It should be noted that during 
measurements, very high field level dynamics, of about 80 
dB, were observed. 


IV. MICROCELL PREDICTION MODEL 


As explained, the proposed model for predicting the 
electric field level of microcell is based on the arrangement of 
the streets and their types. Five submodels are defined in 
accordance with the chosen classification of the streets. The 
propagation curve for each submodel is determined over the 
data collected by the measurements. 

The first submodel is LoS microcell prediction model 
which is used along the main street in an open LoS condition. 
The propagation formula of this submodel is given by: 


Enn (ding ) = 93 3 ing aly qa ; ae ati q, , digs a qo ( l ) 


where Ba {[dBuV/m] is the electric field level to be estimated 


at a point A (fig. 2.), dj, [m] is the LoS distance from the 
transmitter T, to the point A, while qo,...,g3 represent the 
coefficients of the third-order polynomial which are obtained 
by LMS fitting over the measured data. The numerical values 
of the coefficients qo,...,g3 are given in Table I. This formula 
is valid for distances d),, of up to 500 m. 

Next three submodels are related to the streets in which 
LoS is obstructed, but which have the direct connection with 
the main street at a point where LoS with the transmitter 
exists (for example, point A in fig. 2). These submodels 
correspond to the cases of narrow side street, wide side street 
and the main street under NLOS condition. Fig. 2 illustrates 
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the algorithm of the previously described models. At a point B 
the electric field level can be calculated in the following way: 


EDS (ics ? os )= Be (41.5 )- a, an (4 nios) (2) 


where E2 


nlos 


[dBuV/m] is the electric field level to be 


Ee [dBuV/m] is the electric field level 
at the last directly visible point A in the main street at the 
intersection of the main and the side street at d),,[m] distance 
from the transmitter. Py [dB] is the correction factor of the 
electric field level at a point B in the side street at the d,,/., [m] 


distance from the intersection. The correction factors 


as” are defined by expression: 


predicted at a point B, 


ain (Fivos) = 93° ds 8 9, 3 Doves ar q, Ditos 2 % (3) 
This formula is valid for d,,,, distances of up to 100 m for 
narrow side street, 125 m for wide side street and 200 m for 
main street under NLoS conditions. In the case when the 
distance d,,),, is greater than the previously mentioned values, 
the submodel for backyards and parallel streets should be 
used. The numerical values of the coefficients go,...,q3 are 
given in Table | for each type of street. 


Fig. 2. Electric field level estimation in the case of side streets (B) 
and backyards or parallel streets (C) 


The last submodel is used for the backyards between the 
building blocks and for parallel streets (with respect to the 
main). This submodel is based on the correlation existing 
between the signal attenuation and the total depth of building 
blocks, along the radio path [1]. This assumption is not 
entirely true because the strong signal received at the mobile 
unit comes from the multipath reflected waves, not from the 
waves penetrating through the buildings. However, according 
to the assumption, if the depth of building blocks is greater, 
the signal attenuation is higher. Building blocks database is 


used to calculate the length of a direct wave path being 
obstructed by the building blocks. An additional attenuation 


factor we is obtained by subtracting the measured electric 


field level Ez at point C (fig. 2) from E; measured for the 


same distance d. when there is no any obstacles on the direct 
path between the transmitter and the receiver. In conclusion, 


propagation formula ) i and additional attenuation factor 


ae is used to predict the electric field level at point C: 


Evtos os (Aes d,) = Exe (a.)- ar (d,) (4) 


where Ee [dBuV/m] is the electric field level to be 
estimated in the backyards and parallel streets, and 


ES [dBuV/m] is the electric field level at the distance d. in 


LoS condition. one [dB] is the additional attenuation factor 


based on the building blocks of depth d, [m] (up to 150 m) on 
the direct path between transmitter 7, and receiver C and is 
given by: 


Aroce(4s) = 95°4; +9,°d; +9,-d, +4 (5) 


When the depth of building blocks \is greater than 150 m, a 
25 dB is to be used. The numerical 


values of the coefficients gg, ...,g3 are given in Table I. 
The electric field level has been measured for three 
antenna heights. The 9dB/dec antenna height gain is observed. 


constant value of Osock = 


TABLE I 
POLYNOMIAL COEFFICIENTS OF PROPAGATION FORMULAS 


| Submodels* | gs-10* | got0* [a [ao | 


Eo (é) 90 0651 | 0.6789 -0.2873 
4.343] a 1.0228 127.72 
a, jos (d, os) ‘ 
main 
Een 0.5246 0.4667 128.39 


Ho a a ve 


- model assumes that the transmitter power is 1W and antenna 
height is 3.2 m 


V. MODEL VERIFICATION 


The quality of the obtained microcell prediction model has 
been verified over measured data sets collected for a new 
transmitter site. Also, the electric field level was calculated 
for the new transmitter site. Fig. 3 shows the result of the 
prediction. On fig. 3, the black colour represents the buildings 
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while different shades of gray colour denote different electric 
field levels. Note that the model works quite reasonably: 
propagation is the best through the main street, the signal 
level decrease faster through the building blocks than in the 
side streets, etc. Due to the fact that the proposed model is 
statistical, the prediction process is fast. It takes just several 
minutes on a PC Pentium computer for one transmitter site. 


MB icoo | BB oreo seck<or2 sacks 80<E<84-76<K<80 
WB 72<1:<76 OB 68<<72 9 o4<r<68 BM oo<n<64 H so<ti<on [I s2<0<s6 
WB ss<i<s2 BB sacvcss gocn<ay k<40 ( in dB V/m) 


Fig. 3. Coverage prediction for the area shown in fig. 1. 


An example of the difference between the predicticted 
results and the measured data for one test route in a wide side 
street is shown on fig. 4. It is evident that there is a strong 
autocorrelation between the neighboring samples. The local 
objects with greater dimensions are causing this effect (trees, 
buses, street shops, etc.). However, in total, it was observed 
that the prediction error of the model has a normal 
distribution. The distribution parameters of the prediction 
error for the specific test routes are calculated and shown in 
Table II. 


TABLE II 
DIFFERENCE BETWEEN THE PREDICTED AND THE MEASURED FIELD 
LEVELS (A - MEAN ERROR, 6 - STANDARD DEVIATION) 


A [dB] 
Main street 122 
(LoS conditions) 


Wide side street 
(NLOS conditions) 


Narrow side_1 
(NLOoS conditions) 


According to the obtained results, it is evident that the 
proposed model is accurate and reliable. At the same time, the 


algorithm remains simple in comparison with other statistical 
models which means that it can be easily applied in 
programme realization. It should be also noted that the very 
good performances of the model have been observed also in 
practice, during the real radio-system designs. 


prediction error [dB] 


0 1000 2000 3000 4000 5000 600 
sample 7 
Fig. 4. The difference between the prediction results and 
the measurements (case: wide side street) 


VI. CONCLUSION 


The proper radio system design requires accurate and 
reliable radio channel models, the field strength prediction 
models being the most important. In this paper, the microcell 
prediction model which is based on the arrangement of the 
streets and their types, is presented. The specific propagation 
curves for each of the defined street types are determined over 
data collected by extensive measurements in different 
microcell environments. The main idea was to develop the 
more accurate statistical model with respect to the existing 
microcell prediction models, which have to remain simple at 
the same time. 

According to the obtained results, the proposed prediction 
model could be successfully used in most typical microcells 
as an fast applicable model. The accuracy of the proposed 
model is very good in respect to existing statistical models. In 
more specific local cases, the accuracy of the model can be 
easily improved by additional measurements. 
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Wireless communication employing high altitude 
long endurance aeronautical platforms 


Sa8o Tomazit, Member IEEE, Ale& Vugrinec, Student Member IEEE, Polona Skraba 


Abstract--This article presents a possible alternative solution 
for wireless service providers. Wireless communication is 
currently the fastest developing branch in telecommunications. 
The number of wireless, especially mobile, subscribers is 
increasing at a much faster rate than the number of wire 
subscribers. Today, wireless communication is divided into 
terrestrial and satellite communications, both of which bring 
with it certain benefits and drawbacks. The described High 
Altitude Long Endurance Aeronautical Platforms present an 
intriguing alternative. From the  telecommunications 
perspective, they combine the advantages of both the 
terrestrial and satellite counterparts. When the HALE reaches 
the stage of operating in a stable fashion, becoming 
economically acceptable, reliable and safe, they will offer a 
new mode of providing telecommunication services. 


Index terms--Wireless communication, mobile communication, 
satellite communication, LEO, NEO, and HALE 


I. INTRODUCTION 


In today’s society, people exchange enormous 
quantities of information ona daily basis in every aspect of 
their life, be it job related, domestic or otherwise. The ideal 
communication system should, therefore, be based on PCS 
(Personal Communication System), enabling users to 
communicate with anybody, anywhere at anytime, provided 
that they use their unique personal TNs 
(Telecommunication Number) Such a system can be 
achieved only if wireless and wireline communications are 
closely interwoven. 

Wireless communication is divided into terrestrial and 
satellite communication. Both systems contain both 
advantages and disadvantages. Terrestrial communication, 
especially mobile, is subject to signal dissipation, distortion, 
fading and other mishaps. The spectrum, as well as the 
capacity, for mobile communications is therefore limited. 
The capacity can be substantially increased by means of 
cell shrinking. However, this requires a larger number of 
base stations, mobile switching centers, as well as 
additional links between them. In short, an expansion of the 
terrestrial communication infrastructure as well as its 
deployment would be essential. 

Satellite systems can provide wireless access over 
larger areas with little terrestrial infrastructure. However, 
for communication using GEO satellites, users require quite 
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a large, expensive and inconvenient terminal. Signal delay 
due to large distances between the satellite and the user is 
also a problem that is not easily remedied. Satellite systems 
in low orbits don’t suffer from signal delay, as the distance 
from earth is shorter, but this requires a large number of 
satellites in order to provide coverage for the signal at any 
location at all times. The consequence is an increase in the 
complexity and price of the system. 

Satellite and terrestrial mobile systems have certain 
very positive features and advantages, which were, in fact, 
the reasons for their introduction. However, there are also 
many disadvantages, which are sometimes very troubling 
and inconvenient. Because the disadvantages of these 
systems are due mainly to the nature of radiation and 
propagation of electromagnetic waves, they cannot be 
avoided in the given economic standards. Therefore, a 
completely new concept for communication systems must 
be introduced. 


II. COMMUNICATION OVER HIGH ALTITUDE PLATFORMS 


The solution to be described in this article is very 
unique. It has various names such as HALE (High 
Altitude Long Endurance) aeronautical platform[1], 
HALO (High Altitude Long Operational) aeronautical 
platform [2] or HAAP (High Altitude Aeronautical 
Platform) [3]. It is a completely new concept in building 
an infrastructure for telecommunication system. As such, 
it would unite the advantages of a satellite and terrestrial 
mobile system into the formation of a new system, as well 
as bring in some advantages of its own. The following 
summarizes the most apparent advantages: 

e Mobile terminals should be comparable in weight, 
function and capability to terminals used in terrestrial 
mobile systems (such as GSM, DCS1800, IS-54). 

e = The signal delay isn't problematic as in GEO or MEO. 
Health concerns are relatively low because user 
terminals operate on low output power as well as earth 
stations for uplink, which can be located in rural or 
unpopulated areas. 

e System implementation and growth can be performed 
step-by-step as in terrestrial systems with relatively 
easy upgrades and services. 

e The radio channel is the most important benefit of the 
HALO system because it utilizes free space propagation 
with distances comparable to terrestrial systems. 

e Indoor coverage is substantial and dependent on the 
look angle and operational altitude of the platform. 

e Fading is comparable to LEO satellite systems with 
difficulties arising only on the borders of geographical 
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coverage because of low look angle and possible 
geographical obstacles. 

e The communication infrastructure consists of few 
gateway stations for the coverage of large areas, unlike 
terrestrial systems, which require hundreds of base 
stations for similar coverage. 


A. Properties and types of aeronautical platforms 


High altitude aeronautical platform is a common 
name for an airborne vehicle such as an airplane, airship, 
helicopter or hybrid. The platform should operate at 
stratospheric altitudes from 15000 to 30000 m. Such an 
altitude is far above any civil aviation flight level. For a 
successful platform, the following features should be 
considered: 

e A lower development, production and operational 
cost in comparison with satellite, 

e Acomparable or longer life time than a satellite by at 
least a few years, 

e Sufficient operating time of the station depends on the 
type of platform, 

e Ability to float over a fixed point or to fly on a typical 
circular route, 

e Ability to carry a large multipurpose communication 
payload. 


Worldwide measurements indicate that there is a 
minimum average wind velocity of 20 to 22 km measured 
over longer periods of time at the stratospheric altitudes. 
This data, provided by NASA, can be obtained from 
reference [4]. A typical profile of the wind velocity is 
shown in Figure |, obtained over Milan. 


__.. IND SPEEDS AS FUNCTION OF ALTITUDE AT MILANO LINATE 


ave thea ip Ow tenet Te erate: 


Figure 1: Monthly average wind velocity at Milan 


With a simple formula, the average instantaneous 
power P needed to withstand the wind force applied on the 
platform can be calculate, 


where Pp is the air density, Cy is the drag coefficient, S, is 
the platform’s cross-sectional area and v is the wind speed. 
For more accurate calculations, the platform’s structure and 
shape as a function of different Reynolds numbers should 


be considered. From the above formula, it can be concluded 

that the possible airplanes would not have difficulties 

withstanding the wind force. However, wind force is the 
airship’s largest obstacle as well as the frequently stated 
critique about aeronautical platforms. 

Platforms can be divided into two classes: heavier 
than air and lighter than air. The first class of platforms 
typically consists of airplanes and helicopters while the 
second consists of airships or balloons with a shape similar 
to the Zeppelin class airship. Recent developments in 
aviation technology utilize very strong and light composite 
materials, computer and satellite use in navigation, new 
aerodynamic wing profiles, very efficient internal 
combustion engines or electromotors powered by solar 
energy. New advanced avionics, especially in UAV 
(Unmanned Aerial Vehicles), represent the peak in aviation 
technology and permit realistic opportunities for 
establishing communication platforms. 

Let us review some _ possible contenders for 
aeronautical platforms: 

e DARPA (Defense Advanced Research Projects Agency) is 
a manufacturer of highly sophisticated aircrafts. The most 
intriguing is the Global Hawk, an UAV aircraft with 
astonishing characteristics. With its endurance of 40 
hours, an operational ceiling of 20 km and capability of 
carrying a 2000 kg payload is the most suitable aircraft for 
platform operation. The problem with this aircraft is its 
purpose for mainly military use [5]. 

e Angel Technologies Corporation is a company, which is 
developing a complete communication system HALO 
with an aircraft Proteus. Burt Rutan, better known for his 
unlimited flight around the globe, developed this aircraft. 
Proteus is an airplane with a two to three person crew. It 
has a variety of operational ceilings, ranging from 15 to 21 
km. It can carry similar payloads as the Global Hawk, 
however, with only half the time endurance. Further 
information on the Proteus can be obtained in [6]. 

e ESA (European Space Agency) is also developing an 
UAV aircraft especially for platform use. Currently, 
this project seems to be slowing down, as nothing 
new can be obtained from ref. [1]. 

e NASA (National Aeronautic & Space Administration) is 
manufacturing and developing an ultra light UAV 
airplane under the ERAST (Environmental Research 
Aircraft and sensor Technology) program. Thus far, 
three very similar types of airplane have been 
constructed and named Pathfinder, Centurion and 
Helios. All of the aircrafts are a flying wing concept 
with solar powered electrical engines and several pylons 
for the communication payload. While the first two are 
being developed as research aircrafts, the third one is 
mainly dedicated to the communication platform. The 
astonishing achievement of Pathfinder was its flight to 
the altitude of 80.400 feet, which is an altitude record for 
propeller driven aircrafts [8]. 

¢ HELIPLAT (HELios PLATform) is a project to design an 
ultra light UAV with solar cell propulsion. The project 
was granted to Politechnico di Torino by ASI (Italian 
Space Agency). In concept, it is very similar to the 
Pathfinder [9]. 

¢ — Rotostar is a project from Silver Arrow and the Tel-Aviv 
University to produce a solar powered UAV for 
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stratospheric platform missions. Rotostar is designed as a 
rotating wing aircraft, in concept very similar to 
helicopters. It uses a solar powered thermodynamic 
engine based on the closed Brayton cycle [10]. 


The other type of platform consists of the lighter than air 
aircrafts. In this class of aircrafts, only two company’s 
presented possible airships for platform use. 

e Skysat Communications Network Corporation 
introduced a plan for urban and _— suburban 
telecommunications services via an airship based at 
70.000 feet. Airships should be without a crew and 
powered by conventional energy. With a length of over 
300 meters and a diameter of around 50 meters, this 
would be quite an enormous dirigible. 

e Sky Station International Inc. is the second and much 
more ambitious contender to offer telecommunications 
services via it’s kind of sky ships. The company intends 
to put over 300 sky ships over major US cities. The 47 
GHz frequency band is already chosen and approved by 
the FCC. The sky ships would be 150 m long and about 
60 m in diameter. The estimate is that an Internet access 
of approximately 2 Mb/s via the Skystation would cost 
only a few cents per minute [11]. 


B. Telecommunications architecture and issues 


Typical telecommunication architecture of an HALE 
system is shown in Figure 2. 


Gateway 


Public network ~ 
(PSTN, ISDN, ATM) 


Coverage area 


Figure 2: Architecture of HALE mobile system 


The platform is stationed above a coverage area, which 
is divided to cells or macro cells similar to a terrestrial mobile 
system. This can be achieved with a multibeam antenna 
group. Such a device is usually a group or an array of 
elementary dipoles with individual phase steering. The size 


of the coverage area depends on the minimal look angle Ofnin - 


and the altitude of the platform. The minimum look angle 
should be carefully chosen. For users inside buildings, the 
angle \should be as small as possible for the signal to 
penetrate the walls, while for outside users the requirements 
are the opposite. Under these conditions the coverage area 
should be between 30 to 5O km in radius. The platform is 
constantly moving, rolling and pitching, and the antenna 
should compensate this movement either with phase beam 


steering or mechanical steering. The type of multiple access 
system that will be used depends on the requirements of 
regulation, the frequency band and company policy. For 
example, angel technologies proposed two bands of 28 GHz 
for local multipoint distribution and 38 GHz for microwave 
point-to-point services. As proposed, an 850 MHz band 
would be divided for downlink (300 MHz), uplink (300 
MHz) and guard band (200 MHz for signalization and 
control of the platform). 

Communication equipment on board will be 
minimized to only the essential components of a mobile 
system. These consist of the multiyear steer able antenna, 
antenna interfaces and a multichannel transponder. All 
other components of the mobile system, such as a base 
station (BS), mobile switching center (MSC), billing 
center and interface to the public network, will be located 
on the ground station. This results in the platform serving 
as a relay station similar to some satellite systems 
(example, Globalstar); while the ground station performs 
functions such as handover, channel allocation, roaming, 
and polling, to name some. A brief overview of payload 
and ground station equipment is shown in Figures 3 and 4. 
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Figure 4: Ground station equipment 


C. Capacity of airborne communication systems 


The main advantage of the HALE wireless 
communication system compared to the satellite and 
terrestrial systems is in its path loss characteristics. In Fig. 
5, the typical path loss can be compared to the distance 
relative to each system. In the diagram, three dots can be 
observed representing geostationary, low earth orbit 
satellite systems and the HALE airborne system. The bold 
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line represents path loss in the terrestrial mobile system. For 
free space, it can be assumed that the path loss is inversely 
proportional to the square of the distance, 1/r’, while in 
terrestrial mobile systems, the path loss is a stochastic 
variable depending on the type of terrain. Mainly, this is 
empirically determined and can be written as 1/r’ where N 
is a number between 3 and 5, representing rural areas and 
densely populated urban areas and city centers, 
respectively. When larger areas are being considered, a 
value of 4 is usually assumed [12]. 
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Figure 5: Path loss in free space and on the ground 


The path loss in a satellite system is still considerably 
higher regardless of the improved conditions in free space 
propagation. Even for LEO satellites (for example, 
Iridium), the path loss is comparable to a terrestrial cell 
with a diameter of 20 km. The opposite is true with the 
HALE, whose path loss at 20 km altitude is comparable to 
that of a terrestrial cell with a 4 km diameter. The next 
factor, the Ricean type of fading, considerably improves 
path loss characteristics. Because there is one strong direct 
signal from the platform to the user and many weak 
diverted signals, a Ricean type of fading can be assumed 
which improves the signal strength from 0-20 dB over the 
Rayleigh type of fading, commonly used in planning a 
terrestrial mobile system. 

A short calculation of benefits in path loss and power 
management is available in ref. [3]. The author estimates 
that one HALE platform with a coverage area of a 200 km 
diameter could serve 240.000 subscribers using CDMA 
voice channels. This is rather modest, for estimates of up to 
1 million subscribers per platform have been made. 


III. APPLICATIONS 


A large coverage area offers a lot of new possibilities 
in applications compared to terrestrial systems. Users and 
service providers are well aware of all available services 
such as mobile telephony, e-mail, voice mail, fax, and 
Internet access. But some new applications could be 
associated With the platform system. First, this could be a 
new way to provide broadcasting services DBS (Direct 
Broadcast Satellite), using the platform system to obtain 
two-way communication. This would enormously increase 
public interest in interactive communication (interactive 
TV, Internet). The other application could be providing 
PCS services to users in areas where platform systems are 


economically more attractive than any other system. The 
areas around city centers and urban areas are especially 
attractive for HALE technology. Another factor in favor of 
the platform system is the time needed to establish a fully 
operational system. This could be accomplished in a much 
shorter time than any satellite or terrestrial system. For 
short-term events like the Olympics, world championships, 
or natural disasters, where a lot of converge is needed in a 
short period of time, establishing communication over 
platforms could help lighten the load for the existing 
terrestrial or satellite communication infrastructure. 


IV. CONCLUSION 


In short, it can be concluded that HALE technology 
could serve as a very good basis for establishing a public 
wireless communication system. The platform system 
combines the positive features of satellite systems, due to 
the much shorter distance between the user and platform, 
while, when comparing to the terrestrial systems, the 
platform system avoids signal fading and dissipation but 
maintains comparable path loss. Furthermore, the platform 
itself is more accessible than a satellite, so it can be 
repaired, in case of malfunction, or upgraded with new 
equipment. It is also more convenient for partial 
construction, testing and use in special events. 

Lastly, it should to be mentioned that the HALE 
technology is almost ready to be deployed. Fixing an 
airplane over a certain point is relatively easy, as it can 
always fly in tight circles and compensate for rolling and 
pitching with adaptive antennas. Fixing a balloon is much 
tougher, but balloons have the great advantage of endurance 
in the range of several months. When it will be 
economically efficient and reliable to fix any kind of the 
platform over a coverage area, users will gain a new 
alternative method of communication. Until then, existing 
platforms will remain mostly in military use for 
reconnaissance, observing, command and communication. 
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A Simple Analysis of a CDMA conaes Radio 
System 


Mohammed ABDEL-HAFEZ, and Mehmet SAFAK 


Abstract— A simple analysis of cellular CDMA radio sys- 
tems is presented. This analysis is based on the cumulation 
of the mean power levels of the interfering signals, instead of 
their instantaneous power levels. Such analysis is valid for 
micro/macro-cellular radio system where the received sig- 
nal suffers from Rician/Rayleigh fading. The system per- 
formiance is determined in terms of the probability of bit 
error, processing gain, Rice factor, the number of users and 
the multipaths in an exponential multipath intensity profile. 
The results obtained were observed to agree perfectly with 
those presented in literature. This study provides a simple 
and yet accurate tool in the analysis and design of cellular 
CDMA systems. 


I. INTRODUCTION 


In recent years, considerable research has been devoted 
to the use of direct-sequence coded division multiple ac- 
cess (DS-CDMA) technique for multiuser applications to 
accommodate more users than TDMA and FDMA. For its 
resistance to multipath fading and multiple access interfer- 
ence, CDMA is implemented to serve multi-user wireless 
communication systems. 

The wave propagation in cellular radio channel is influ- 
enced by the path loss, multiuser interference, selective 
Rician/Rayleigh fading (fast multipath fading), and log- 
normal shadowing (slow fading). In DS-CDMA, multipath 
fading is frequency selective in view of the large instanta- 
neous frequency bandwidth required by each user. Ideal 
power control is assumed to eliminate the effects of shad- 
owing and path loss (near-far effect) which cause slow vari- 
ations in the received signal level. The multipath fading in 
radio channel is characterized by Rician/Rayleigh statis- 
tics depending on the presence/absence of the line-of-sight 
(LOS) between the user terminal and its base station. Such 
channel is known to be frequency selective with dominant 
path undergoing Rician fading while the other paths are 
characterized by Rayleigh statistics. 

This paper presents a simplified analysis of a direct se- 
quence spread spectrum cellular radio system, which is ad- 
dressed in [1] and [4]. In [1], the multiple access interference 
is accounted for by the sums of the instantaneous received 
power levels of the signals other than the user signal. This 
model will be referred to, in this paper, as instantaneous 
power based model (IPBM). However, the results in [2] 
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permit us to consider the effect of the multiple-access in- 
terference in terms of the cumulation of the mean power 
levels instead of their instantaneous values. This model, 
which will be referred to as the cumulative power based 
model (CPBM), is considered as a simplifying approxima- 
tion to the IPBM. The results obtained by the two models 
were observed to be the same for engineering purposes [2]. 

The performance of DS-CDMA based cellular radio sys- 
tem is studied in terms of BER using DPSK signaling in 
an environment of AWGN, multiple access interference, fre- 
quency selective Rician and Rayleigh fading. The effects of 
bit energy to noise ratio Bs processing gain, Rice factor, 
multipath intensity profile, and the number of simultane 
ous users are presented. 

The paper is organized as follows, channel and system 
models are presented in section II. A comparative study 
between CPBM and IPBM for both Rician and Rayleigh 
fading channels are considered in section III and IV re 
spectively. Finally, results and conclusions are presented 
in sections V and VI, respectively. 


II. CHANNEL AND SYSTEM MODEL 
A. Channel Model 


Propagation in wireless radio system is largely influenced 
by three different propagation phenomena, path loss, se- 
lective multipath fading, and shadowing. A perfect power 
control is considered to eliminate the effect of shadowing 
and near-far effect. In our analysis we take only the selec- 
tive fading into consideration. 

In a wireless radio environment, the received signal ex- 
periences multipath fading. If spread spectrum is used, 
with chip duration less than the maximum delay spread 
of the channel, the fading channel is considered to be fre- 
quency selective, otherwise the fading is considered to be 
frequency nonselective. In this paper, we consider a gen- 
eral fading channel model, which characterizes most of the 
wireless communication channels. The low-pass impulse 
response of the channel is commonly modeled by a tapped- 
delay line with spacing equal to T, (chip duration). Since 
the maximum delay spread is T;,ax, the tapped delay line 
can be truncated at L = |Tinaz/Tc| + 1 taps, where |=], 
is largest integer contained in x. The delay of the multi- 
path components of the first path are much smaller than 
the chip duration T,. Thus the fading of the first path is 
modeled as a non-selective Rician fading. The remaining 
L—-1 paths consists of multipath components, each of them 
is modeled by a Rayleigh pdf, (see Fig. 1). 


365 


RS 
RSs 
2 
<> 
< 


Sod 
SS 


Z 


XP 
<2 


RRR SSS 
LOSS 
5 
KS 


Xs 


Kee 


> 


EXOD 

Sex 

Mes 

VON 
es 


i 


LS 


(Li-2)Te (Li-1)Te 


Fig. 1. The MIP of Rician channel 


Let 77 be the instantaneous received bit energy-to- 
background noise ratio of the ith user through nth path. 
The dominant path of the ith user, 7, is usually modeled 
by Rician fading 


1+K; 1+ Ki) a+ Kine, 
f.e(t) = —ptex (ee eens) 
Yo: Yoi 
e (2 orf ke) (1) 
Yoi 


where 79, denotes the local-mean value of the ith user 
through Oth path and it is given by the sum of the LOS 
component (7§;_,0g) and diffused components (79; 4:5) : 


101 = Yoiros + Yidisp = + Ks) Wiese (2) 


Here the Rice factor, K;, defined by the ratio of the LOS 
to the diffused components as: 


0 
ies Y0i-LOS 
Oi_dif f 


(3) 


The multipath components for n > 0, are modeled by ex- 
@ 


ponential pdf 
exp | — 
( YOidif ; 


where 6; aiff 18 the local-mean level for the ith user 
through the nth multipath. The total instantaneous re- 
ceived bit energy to background noise ratio of the ith user, 
i, is defined as: 


Be) 
0 TeV 


fap (2) = 


(4) 


mr 
Oi_dif f 


L-1 


n= doe 


n=0 


(5) 
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The multipath intensity profile (MIP) can be modeled by 
various forms, but quite often an exponential model is used 
in a wireless radio channel [1]. Diffused local-mean level of 
ith user through nth path, 7; if, Can be expressed in 
terms of total diffused local-mean received bit energy-to- 
background noise ratio, ¥G;_4:f 7» by : 

Vbsaip¢ = Cn Younis f (6) 
where 3%, is defined for exponential MIP, 


a Vie (0 


SEO 


where ¢; = L; —1 [1]. 

The total ith local-mean bit energy to background noise 
ratio is defined by the addition of the LOS component of 
the dominant path and the total diffused components as: 


ae (8) 


0 Ty 
oi = Yoi-Los + Y0i-dis s = 
S108 Reis. No ith user 


inserting (6) into (2) one gets 


Oo: = (1+ Ki) Boiss (9) 


From (3),(6) and (8), the total power of the ith user is 
written as: 


Yor = (1+ Ki) Vosassf (10) 


Using (9) and (10), 78; can be expressed in terms of the 
total bit energy to background noise ratio as: 


9 _ + Ki) H, 

US= (1+ Kis) 07 

When K; = 0, we obtain the selective Rayleigh fading, 
while for BB = 1, we can have the non-selective Rician fad- 
ing, in case K; = 0 and (3 = 1, we obtain the nonselective 
Rayleigh fading. Note that (11) is a general formula which 


describe the general frequency selective multipath fading 
channel. 


(11) 


B. Multiple-Access Interference 


The system consists of a single isolated cell with a cen- 
trally located base station. There are K + 1 instantaneous 
users that are using direct-sequence CDMA to establish 
a full-duplex communication with the base station. The 
multiple-access interference after the receiver matched to 
the desired signal is usually modeled as additive white 
Gaussian noise. The instantaneous received bit energy-to- 
total noise ratio associated with desired jth user is, [1,2]: 


pops ree es 
7 No+n PT, 1+97!97 
Here, P;T, stands for the instantaneous energy per bit du- 
ration, T}, of the desired signal, with instantaneous power 
P;, No denotes the one-sided spectral density of AWGN, 
7 is the processing gain and P;Tj, is the total interference 


(12) 
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energy per bit. The bit energy to background noise ratio 
of the zth user is denoted by: 


_ AT, 
b= No 


The total interference bit energy to background noise ratio 
is defined by: 


(13) 


K Li-1 


Ti ok ae 


7=0 n=0 


tJ 


(14) 


III. PERFORMANCE ANALYSIS OF RICIAN FADING 
CHANNEL 


The bit error probability associated with transmitter 7 
is 


ip | fra, (rj) P (rg) dr (15) 


where fr, (7;) is the conditional pdf of R; defined by (12). 
P(r;) denotes the BER for non fading case for the consid- 
ered modulation scheme. Here we consider DPSK modu- 
lation scheme with P(r;) defined by 
i el 

P(rj) = 5e7 (16) 

The system analysis is carried out in terms of the BER 
performance for the models IPBM and CPBM. A multipath 
rejection receiver is considered that processes the signal 
received over the zero path, and rejects the signal received 
over all other paths. 

It should be clear that the bit error probability is the 
same for every transmitter and, therefore, the dependency 
on jis removed. Without loss of generality, we consider the 
desired user to be 7 = 0 user. We shall consider the case 
where the MIP is identical for all interferers, yj; = yg, and 


A. Cumulative Pewer Based Model (CPBM) 


This method assumes that the multiple-access interfer- 
ence can be accounted for by the sum of the average power 
levels rather than the instantaneous power levels. This 
means that none of the multiple-access interferers under- 
goes fading. This mean that their Rice factors are infinity, 
K; = o for 1 <i < K, and their pdf’s corresponds to 
delta functions, 6 (7? — 7@;). 

The desired user signal to total interference ratio, R , is 
written as: 


2 PHT» awe 
No+17*Poclb 14+ 17710 
where Fo.J} is the total local-mean bit energy of the cu- 
mulative multiple access interferers, with local-mean power 


defined by Poc = >> Pi}, and yo. be the local-mean bit 
(i,n) 


(17) 
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energy of cumulative multiple access interference to noise 
ratio defined by yo. = >> 7; . Here we have only one 
in 


random variable which denotes the desired dominant path 
bit energy to noise ratio, 9. 

The total interference bit energy to background noise 
ratio is defied as: 


Yor = Kou (18) 
The instantaneous bit energy to background noise ratio of 
the dominant path of the desired user,78, is defined by 
Rician pdf (1). For DPSK the conditional BER due to 


K active multiple access interferers is found by using (1), 
(15)-(17), and 


—ar 1 b 
i € 7 Ig (2vér) Oye = oP (=) (19) 

to obtain the following closed form expression 
P, -K Z (20) 

PRIA Me yA 
where A is defined by: 
0 

A ‘Yoo (21) 


(1+ Ko) (1 +7700) 
B. The Instantaneous Power Based Model (IPBM) 


This model is based on the assumption that all multiple 
access interferers suffer Rayleigh fading, i.e. the Rice factor 
of their dominant path is zero. The analysis for Rician in- 
terferers in this model is omitted due to its complexity and 
unnecessarily long formulations. Assuming that all mul- 
tiple access interference are i.i.d random variables having 
the same MIP, i.e. 70; = 10; = Yu and Li = Lj = Ly 
for {(i,j),1<i,j7 < K,i# J}, the desired signal to total 
interference ratio is written as: 


R= 70 (22) 
Lu-1 K 
Toei ( py, vt) 
n=0 2=1 
jig ih Iie 
where SS Soong represents the multiple access interfer- 
n=0 i=1 


ence. The random variables, 7; , have Rayleigh pdf but ar 
is Rician. 
Let us define the following random variables, 


(23) 


Using similar approach to [1], one can find the pdfs of Y 
as: 
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where 


1d |r 1 
——— ———z (25) 
»= as (IT R 
= ~ 
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YOu 
For accurate calculation of (25), (13)-(15) in [5] could 


be used. 
Following the same procedure as in [2], one can obtain 
the following the pdf of the ratio R : 


fr(r) = 
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For DPSK , substitution of (1) and (16), where r is defined 
in (26), into (15) and using (19) leads to the following: 


(1 + Ko) Ko (z ata 1) 
0 
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) dz (26) 
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The above integral can be evaluated by the Laguerre inte- 
gration method [6]. 


os 


IV. PERFORMANCE ANALYSIS OF RAYLEIGH FADING 


We shall consider the same macro cellular radio system 
as in [1], where the base station is located at the center of 
a single isolated circular cell. The direct-sequence CDMA 
system is assumed to provide access to K +1 mobile users, 
which are randomly located within the cell. All users ex- 
perience frequency selective Rayleigh fading. 


A. Cumulative Power Based Model (CPBM) 


We can follow the previous analysis with desired Rice 
factor to be zero, Ko = 0 (Rayleigh). For IPBM, the fol- 
lowing BER expression can be obtain 


Ly-1K-1 
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where E,, (.) denotes the exponential integral of order n. 
This formulation is similar to (33) in [1]. 
B. The Instantaneous Power Based Model (IPBM) 


With CPBM, the following form the BER is obtained 
from (20): 


1 


[— 


(29) 


2 1 ot 1+ Kyou 
This is similar to that given in [3, Eg.14 — 3 — 10], with 
SNR defined by 


Boyoo 
SNR = —————_— 30 
1+ 771K ou oY 

V. RESULTS 


The results in Fig.2 and Fig.3 show the effect of delay 
spread (number of paths, L ) and the number of multiple- 
access interferers, K, on the system BER for 7 = 100 and 
DPSK using multipath rejection receiver. The effect of 
delay spread, L,, = Lo = 2 and 4, is less for large values of 
the Rice factor, Ko = 7dB, than for small values of Rice 
factor, Ko = OdB. From Fig.3, it can be concluded that 
the effect of the number of multiple-access interferers, K, 
is less sensitive to Rice factor of the desired user. For small 
values of the Rice factor, IPBM and CPBM obviously give 
very close results. This is expected because, for large values 
of Ko (less random), the effect of random multiple-access 
interferers is more significant. 


Effect of Rice factor and Fens gain is shown in 
Fig.4, for L, = Lo = 3, K = 2, == = 20dB, and DPSK. 
For low processing gains and high Rice factors, both CPBM 
and IPBM deviate slightly from each other, while for high 
processing gains, they become almost equal. This is ex- 
pected since, in view of the factor 7 in (17) and (22), the 
processing gain attenuates the effect of multiple-access in- 
terference. It is clear that the CPBM is a excellent approx- 
imation to IPBM for low Rice factors. 


The BER expression given by (28) in this paper, and 
in [1,4], may have some accuracy problems because of the 
exponential integral function involved. Fig. 5 provides a 
typical example to this case which is plotted for K = 5. 
L = 4, Ko = 0 (Rayleigh fading), and 7 = 100 by using 
MAPLE V package. As in this case, (29) provides more 
stable results for large values of L and K. 


VI. CONCLUSIONS 


This paper presents a simple analysis of a direct-sequence 
spread-spectrum cellular radio system by using the CPBM 
The results obtained by using this model were observed tc 
perfectly agree with the results found by the IPBM, as 
presented in [1] and [4]. The whole work carried out in [1 
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Fig. 2. BER against local-mean 5 (4B) with power control , for 


different Rice factors and number of paths; K = 1, 7 = 100 and 
DPSK. 


and [4], including the study of the effects of coding and 


diversity, can be simplified by using the CPBM to obtain 
very similar results. The analysis presented in this paper 
can be applied to downlink and uplink CDMA cellular radio 
systems. The simple analysis is believed to provide a useful 
and easy tool for analysing and designing cellular COMA 
systems. 
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SERVICES OFFERED BY A PACKET- 
SWITCHED CELLULAR IVHS WITH A 
SINGLE RADIO CHANNEL 


Roxana Zoican, Ph. D. 


Abstract: This paper analyzes a radio architecture that 
offers two-way transmission services to [VHS ("Intelligent 
Vehicle Highway Systems''), in a typical metropolitan area, 
using only a single radio channel of 30kHz. There are 
presented the multiple communication sevices that can be 
supported within this bandwidth: datacasting, packet switched 
transmissions to and from vehicles, collection of traffic data 
from probe vehicles and transmissions of emergency messages. 

The paper analyzes two cases: 

- a normal peak-period traffic and 

- a case of a major highway incident (during the 
peak-period traffic) creating abnormally high localized [VHS 
data loads in the central area of Bucharest. 

There are presented the following results and curves: 
expected waiting time vs. location for the reuse factor Q=1, 4, 9 
(with no shadowing); expected waiting time vs. location for 
Q=1, 4, 9 (with 7dB shadowing); the average Delay- 
Throughput curve for the peak-period traffic. 


I. INTRODUCTION 


The network architecture and transmission 
standard for VHS are topic of current discussions. One 
approach is to use existing technologies, preferably even 
existing communication services. Some other approaches 
aim at a dedicated infrastructure for IVHS, with its own 
frequency allocation. 


POLITEHNICA University of Bucharest 
Electronic and Telecommunication Faculty 
Telecommunication Department, ROMANIA 


The paper describes a single channel design that 
can provide an efficient [VHS communication network 
within 30kHz bandwidth. Single-channel solutions are 
attractive because of the current assignment of narrowband 
channels to private or public users (state departments public 
utilities, law enforcement, packet delivery services, taxi cab 
operators, railway companies). Since the [VHS user services 
that fall under the classification of "wide-area" generate 
heterogeneous bursty traffic due to their disparate nature, it 
is critically important to predict the performance of any 
wireless system that is proposed to carry such a demanding 
data load. 

In this paper there are analyzed the performances 
of a Cellular Digital Packet Data (CDPD) system to assess 
its ability to provide these services in a typical metropolitan 
setting. In Section 6, there are reported the results of a 
simulating test scenario, using a mobile system simulator. 
There were focused the two most important cases in 
determining system design viability: normal peak-period 
conditions and the case of a major traffic incident during the 
same peak-period, causing long expressway queues and 
abnormally high localized IVHS data loads. 


Il. ADAVANTAGES OF A DEDICATED IVHS 
COMMUNICATION INFRASTRUCTURE 


Research on sharing communication resources 
among multiple users and services has led to a wide variety 
of techniques for multiplexing, switching and multiple 
access to communication resources in a wired 
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infrastructure. The IVHS communication requirements 
appear to be well served if one manages to efficiently 
combine a random access and frequency reuse in a dynamic 
way for packet-switched networks, thus on a packet-by- 
packet level rather than on a session-by-session basis. 

A IVHS communication infrastructure has several 
advantages over a hybrid communication architecture using 
existing systems: 

e ElectroMagnetic Compatibility (EMC): european 
car manufacturers complained that the GSM cellular phones 
interfere with some anti-lock brakes systems; airlines 
attempt to make their passengers aware of the EMC 
problems in aviation equipmnent caused by walkman radio 
receivers, notebook computers and cellular phones. These 
problems occur when more advanced systems are subjected 
to radio transmissions in different bands. 

e Spectrum Conservation: piggybacking IVHS on 
existing wireless communication services does not imply 
that IVHS communication does not consume spectrum; it 
only does not require a new allocation for the new service. 

e Flexibility of design is a reason to adopt a 
common radio interface for IVHS services. 

e Safety - communication appears to be crucial to 
the safety of the [VHS systems; this implies the need for 
active enforcement of interference protection. 

e Manufacturing costs may become excessive if 
IVHS requires communication through hybrid architecture 
involving multiple communication receivers and 
transmitters. 

An efficient network can be built if one channel is 
made available in a wide area, covering many cells. Novel 
access schemes presented in this paper allow contiguous 
use of this channel, which is in contrast to the cellular reuse 
patterns typically used in telephone nets. The design 
presented below adopts the IS-54 channel spacing of 30kHz 
with 48.6kbit/s QPSK for burst transmission of packets of 
324 bits. The frame duration of 40msec is divided into six 
6.67 msec time slots, each including 260 user bits. Using 
the TDFR ("Time Division Frequency Reuse"), interference 
between transmissions in adjacent areas is avoided through 
transmissions in different time slots. TDFR allows a simple 
handoff mechanism, because carrier frequency changes are 
not required in this system. 


Ill. IVHS DATA REQUIREMENTS 


In order to compute the wide-area IVHS services 
data requirements, the whole message set and its structure 
had to be developped, plus the message lengths and 
frequencies of use had to be estimated. The overall data 
load was computed based on those values. As with voice 
traffic, the vehicular and data traffics naturally vary with 
the time of day. Since by design, a communication system 
must accomodate the "busy hour", it will be focused mainly 
the peak-period data requirements. 


The expected system data load is governed by the 
IVHS services market penetration. For that putpose, the 
wide-area IVHS services were classified into five groups 
corresponding to distinct market of user segments: Driver 
Information, Traveler Information, Commercial Vehicle 
Operation (Local and Long Haul), Emergency Vehicle 
Management and Transit Management. The projected 
penetrations and the size of the user pool for each service 
group are shown in Table 1. As an example, if we look at 
the Driver Information entry: for 2 million population (this 
is Bucharest population), a total of 1075000 drivers are 
assumed, out of which 20% are expected to have access to 
Driver Information services. 


Tabel 1 


User 
Pool Size 


Penetration 
Estimate 


User Service 
Groups 


Traveler 20% 
Information 220000 
Commercial 70% 
Vehicle 50000 
Operation 
Driver 20% 
Information 1075000 
Transit 90% 
Management 400 
Emergency 100% 
Vehicle 400 


Management 


IV. DATA CASTING 


Datacasting is the transmission service that sends 
messages from the infrastructure to all vehicles, or to 
certain groups of vehicles. As the number of message 
destinations is large, feedback of acknowledgements from 
recipients can not be used. Base stations send messages to 
vehicles according to a periodic scheme with a cycle length 
of M messages. Base stations in adjacent cells transmit on 
different time slots to avoid excessive interference ("Time 
Division Spectral Reuse"). With a limited system 
bandwidth allocation, the time it takes to transmit an entire 
cycle is inversely proportional to the reuse factor, Q. On the 
other hand, small Q increases the interference between 
calls. 


Performance of Datacasting 

The signal from the i-th base station is subject to 
multipath fading, log-normal shadowing and path loss. 
Because of multipath fading in a macro-cellular network, 
the instantaneous power p typically is exponentially 
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distribute with local-mean, p/. Because shadowing, the 
received local-mean power p/ has the log-normal pdf: 

folpip!) = —+— expt-— tn?) 

V2 xop/ 20° pli 
(1) 

where o 1s the logarithmic standard deviation of the 
shadowing, expressed in natural units and 

p/!=ar8 

(2) 

where r is the propagation distance and a, B are constants. 

For narrowband radio, the probability of succesful 
reception can be approximated by the probability that the 
C/I (Carmer-to-Interference) ratio is above a threshold z. 

The probability that the signal from the nearest 
base station is at least a factor z above the joint interference 
power (event S) is: 


-y 6 
;e° ; Z [> 
P(Sir) = |—— [ Ath. >S=—— exp(V 2o0y}dy 


(3) 
where index =1,2,...,6 denote co-channel interfering signals 
and Z (7, s} denotes the one-sided Laplace transform of the 
pdf 7 at the pot s. 

Results in figures 1 and 2 show that the worst- 
location expected waiting time is minimized for Q=4 for 
light shadowing (less than 7dB), and Q=1 for more severe 
Shadowing. Small Q results im a relatively large outage 
probability. If a message is lost, it may be received during 
the next cycle. 


00 O01 O02 03 04 05 O06 OF O08 03 1.0 
Location of Receiver 


Fig. 1.Expected waiting time vs_location 


0 02 04 os 08 1.0 
Distance r 
Fig.2. Expected waiting time vs.location 
for Q=1,4,9. No shadowing. 
for Q=1,4,9. 7dB shadowing. 


V. RANDOM ACCESS 


For randomly arriving messages in the uplink, 
vehicles must compete for time slots in the uplink. The 
objectives for optimizing reuse patterns for ALOHA 
networks are conflicting: large Q ensures little interference 
between cells, so few messages are lost in intra-cell 
interference. This however reduces the available bandwidth 
per cell by a factor Q. For a given user density, the 
normalized offered traffic load increases also by a factor Q. 
It has been shown that the optimum is achieved at QI. 
Splitting the available spectrum into Q (Q=3.4....) subsets 
increases, rather than decreases the interfering traffic load. 
In IVHS networks, there can be distinguished three kinds of 
inbound traffic, each of which require a different operation 
point on the throughput-delay curve. Emergency messages 
require extremely low delay, but message volumes are 
small. Collecting data from probe vehicles requires a large 
throughput without any requirements for retransmission. 
Interactive queries require reasonable delay, with 
sufficiently large throughout. To conserve radio spectmum, 
it appears advantageous not to merge these teletraffic flows 
on the channel. One should rather reserve separate each 
category of traffic. 


VI. CDPD SYSTEM PERFORMANCE 


The analysis will focus on a 70 by 70 km? square 
section of an average metropolitan area (in the centre of 
Bucharest) with an important Commercial Complex, two 
banks and an Exhibition Centre. 

All cells are three-sectored and identical antennas 
are used in all sectors. 
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The delay-throughput curve obtained for this 
scenario (with the mobile systems simulator) is shown in 
figure 3. The figure suggests a value slightly above 0.7 
Erlang to be the capacity of the reverse link CDPD channel. 


Delay [s] 


01 0.2 0.3 0.4 0.5 0.6 0.7 0.8 
Throughput [Erlang] 


Fig. 3. Average Delay-Throughput Curve 


From the delay-throughput curve of figure 3, it can 
be seen that the average delay is 125ms. 

A major traffic incident during the peak-period 
creates an abnormal stress on the communications 
infrastructure. The main consequence of such an incident is 
the queuing of vehicles, with most impact occuring if the 
incident occurs on an expressway. The near standstill traffic 
condition would make impatient drivers request route 
guidance, even if they may have no way out. The effects of 
the incident in the expressway would also spill over to 
nearby surface streets, where more route guidance requests 
would be generated. 

It is considered this extraordinary data load in 
addition to the normal peak-period data load, thus 
maximizing the impact of the incident. Furthermore, as a 
worst case, it is considered that the route guidance requests 
are concentrated in a short period of time. 

The data load in the most affected cell sector 
becomes 0.615 Erlang, up from the peak-period (0.387 
Erlang). In the incident case, only one or a few sector(s) 
will see the higher traffic, while others will see only the 
normal peak-period traffic. 


VII. CONCLUSIONS 


Efficient packet-switched mobile networks reuse 
the radio spectrum in a manner that essentially differs from 
cellular telephony. The paper offered a single-channel 
architecture for IVHS packet communication networks. It 
supports two-directional transmission and allows unlimited 
spatial extension, by reusing the same channel in other 
areas. 


In the IVHS community it is commonly agreed 
that the two-way communication delay requirement ranges 
between 1 and 1.5 seconds depending on the wide-area 
IVHS user at hand. Services like emergency notification 
may require as little as 1s delay, but for the majority of the 
services a 5s delay would defitely be satisfactory. 
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Triple-Frequency Operation for a Hand-Set 
Mobile Telephone with Reduced EMW Hazard 
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Mohammed E. Al-Daghistani, Member, IEEE, and Khadija F. Omran 


Abstract— In this paper an attempt to solve the problem of 
electromagnetic wave (EMW) hazard caused by the radiation 
from handset mobile antenna as well as the problem of the 
handset mobile operation with differently tuned frequencies 
was suggested. A quarter-wave triple-two-element array 
antenna was implemented to fulfill these requirements. This 
antenna configuration was capable of operating with three 
different tuned frequencies 817.5 MHz, 1751.5 MHz, and 1984 
MHz. Moreover the radiation pattern of each tuned frequency 
has a notch in the direction of the handset user’s head. Using 
such array antenna in the handset mobile telephone makes it 
compatible with the worldwide different operating systems 
namely GSM-900, PCS-1800, DCS-1900, and DECT and also 
safer to use than the handset mobile with single monopole or 
helical antenna. 


Index Terms—Mobile Telephone, Antenna, EMW Hazard 


I. INTRODUCTION 


In the course of developing the mobile telephone antenna 
for the EMW hazard mitigation and for worldwide 
operation, several research works were conducted [1]-[8]. 
Some were directed towards measuring the amount of 
EMW absorption by the user utilizing a mathematical 
model; while others concentrated on the antenna design 
alteration. However, in all these new ideas some advantages 
over the monopole antenna, which is considered here as the 
basic design of the mobile telephone have been made. 


The idea of multi-frequency operation was only recently 
presented [9]. A helical and a monopole were used in 
different configuration to yield double frequency tuning 
antenna. The authors inspired this idea and progressively 
work for hazard mitigation as well as different tuning 
frequency operation. A previous research by the authors 
[10] was conducted in a similar line of [9] but using dual- 
frequency two-element array antenna. 


In this work the authors upgraded their work even further 
by utilizing triple-frequency two-element array as an 
antenna for the handset mobile telephone. 


The authors are with the Higher Institute of Electronic, P.O. Box (38645), 
Beni-Walid, Libya. 


II. THEORETICAL CONSIDERATION 


The theoretical verification of the two-element array 
antenna was performed using the equation given by [11]: 


E(6)=2*E,* cost cos(¢) * cos(@) + os (1) 


Where, £, = field radiated by element (1), S =spacing 
between the two elements, ¢ =angle between element line 
and azimuth, @=horizontal angle, and A=time-phase-angle 
difference. 


Antenna 2 


Fig. 1 Schematic diagram of two-element array antenna. 


In this paper a triple-two-element array antenna where used 
with specifications and dimensions listed in Table I. 


TABLE I 
SPECIFICATIONS AND DIMENSIONS OF THE ADOPTED QUARTER- WAVE ARRAY 
ANTENNA. 


p Shift (4 
1751.5MHz | 0.268. | 90° | 
1984MHz | 0.303A 


Fig. 2 shows a three dimensional realization of the absolute 
field radiation pattern ( which is a figure of rotation of the 
two dimensional realization) for the two-element array 
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antenna no. 1, in Table I. As shown in this figure the notch 
is in the direction of the mobile user’s head. 


Fig. 2 Theoretical absolute radiation pattern of antenna array no.1, three- 
dimensional realization. 


III. TRIPLE-FREQUENCY ANTENNA DESIGN 


The triple-frequency two-element array antenna design 
adopted in this work is shown in Fig. 3. Each element of the 
adopted array consists of a monopole inside two concentric 
helical antennae and the array is operating at frequencies of 
817.5 MHz, 1751.5 MHz, and 1984 MHz respectively. The 
dimensions of each element were carefully chosen with the 
help of a commercial antenna simulator, these are: 


/s= 11 mm, /m=37.1 mm, 
1h,=24 mm, D,=7 mm, N,= 5.5 turns, 
Iho>=21 mm, D.=5 mm, N>= 4.5 turns, 


and S=45.8 mm. 
D2 Di 
a 9 
; th lm 
KY ih e—.  § —_» 
iis Y 


Fig. 3 The triple-two-element array antenna configuration. 


The array antenna was fixed on a metal box with 
dimensions analogous to a standard handset telephone. This 
arrangement can be useful in increasing the low impedance 
of the helical antenna [1], [4]. Fig. 4 shows the 
measurement of the return loss for the helical-helical- 
monopole combination. 


1 
an 


Return Loss (dB) 
1 
S 


i] 
_ 
A 


-20 
700 900 1100 1300 1500 1700 1900 210¢ 


Frequency (MHz) 


Fig. 4 Experimental results for the return loss of the adopted triple- 
frequency antenna. 


IV. EXPERIMENTAL SETUP AND RESULTS 


An RF signal generator (100 kHz — 2000 MHz) was used 
to supply 16 dBm output power at the desired frequency. A 
3-dB power divider was used to provide two equal-power 
outputs. To get the desired phase shifts between the 
elements in each array, additional special RF/MW coaxial 
transmission lines of predetermined lengths were used. 
These transmission lines are capable to give 135° for the 
three-helical array operating at 817.5 MHz, 90° for the 
three-helical array operating at 1751.5 MHz, and 80° for the 
three-monopole array operating at 1984 MHz. 


The results of several experiments that were conducted to 
measure the power radiation pattern of the adopted array 
both in free space and with the presence of the human’s 
head are presented in Fig. 5 and Fig. 6. These results were 
compared with the power radiation pattern of a single 
monopole/helical antenna, as a common antenna used in the 
handset mobile telephones. 


Observing these results one can realize the advantages of 
using an array rather than single element in the handset 
mobile telephone. Besides the higher gain it produces, the 
two-element array antenna reveals a radiation beam pattern, 
which is almost flat from the side of the user’s head thus 
reducing the EMW penetration into the user’s head. 


The authors have suggested a design where the adopted 
antenna can practically attached to the mobile telephone as 
shown in Fig. 7. 
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Fig. 7 Schematic diagram of the suggested handset mobile telephone with 
actual dimensions of the array antenna. 


V. TRIPLE-FREQUENCY ANTENNA BANDWIDTH 


Observing the return loss results drawn in Fig. 4, one can 
see that the bandwidth with return loss value over 10 dB at 
the higher frequency range (1984 MHz) was 5.5%, at mid 
frequency range (1751.5 MHz) was 7.5%, and at the lower 
frequency range (817.5 MHz) was 6%. The bandwidths at 
the three above given ranges can not be considered high, 
thus, in case of operating with high bandwidth systems a 
reactive matching circuit is needed to extend the bandwidth. 


VI. CONCLUSION 


From the experimental results obtained we can conclude 
that using an array antenna rather than a single-element 
antenna will produce a power radiation pattern with 
minimum effect on users. Also the ability of using this 
mobile in different operating systems in the world makes it 
very handy and businessmen acquaintance devices. The 
adopted array design is proved to be easy to manufacture 
and follows the dimensions of the common handset mobile 
telephone. 
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Abstract—Operating with multi-frequency and simultaneously 
mitigating the EMW hazard in the handset mobile telephone is 
the aim of this paper. To achieve this goal three arrays are 
constructed, each array, which contains three elements, 
corresponds to a different operating frequency. The operating 
frequencies chosen were 966.8 MHz, 1861 MHz, and 1980 
MHz. This choice enables the mobile to be compatible with 
different worldwide operating systems namely GSM-900, 
DCS-1800, PCS-1900 and DECT. The dimensions of each 
array and hence the power radiation pattern shall contribute 
greatly towards mitigating the hazard caused by the EM field 
radiated usually from the mobile set antenna. 


Index Terms—Mobile telephone, Antenna, EMW Hazard. 


I. INTRODUCTION 


Research efforts in upgrading the performance of a mobile 
telephone without altering its basic dimension and function 
have been recently presented. These upgradings comprise 
first, its ability to operate in two [1], [2], and three different 
frequencies [3], and second in designing an antenna that 
yields a power radiation pattern which is least hazardous to 
the mobile telephone user [2], [3], [4]. 


In a contemporary work by the authors [3], it was found 
that using an array containing two elements, each element 
comprised of three concentric antennae, enhances 
directionality of the system and reduces biological hazard 
on human health caused by penetrating EMW. This newly 
designed array can be operated with three differently tuned 
frequencies. In this work the authors took the matter one 
step further by suggesting a three-element array rather than 
two. The array was constructed using different shape 
elements so that three different operating frequencies can be 
received within the same telephone set. Then one can be in 
various locations in the world and still able to use the same 
hand-held set in all these places. The designed array is very 
useful, easy to implement and the experiments conducted 
reveal a considerable of hazard reduction as compared to 
the monopole case. 


The authors are with Higher Institute of Electronic, P. O. Box (38645), 
Beni-Walid, Libya. Email: Salah_Al Mously@usa.nét. 


II. ARRAY ANTENNA DESIGN 


The theoretical radiated field from the linear n-element 
array configuration, shown in Fig. 1, as a function of the 
angle (¢), can be obtained using the equation listed below 


[5]: 
E(¢) =1+e” +22 +e 4 ely (1) 


where y is the total phase difference of the fields from 
adjacent source elements as given by 


y= “ZA c0s4+6 (2) 


Where d is spacing between the elements and 6 is the phase 
difference between adjacent sources. 


If the phase is referred to the center point of the array, the 
normalized field is 


Rs bs sin(n y / 2) 
"—n sin(y/2) 
Where n is the number of source elements. 


(3) 


Antenna n 


Antenna 2 


Fig. 1 Geometrical arrangement of the linear array antenna. 


The specifications and dimensions of the triple-frequency 
Lai -element array antenna used in this work are listed in 
able I. 
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TABLE I 
SPECIFICATIONS AND DIMENSIONS OF THE ADOPTED ARRAY ANTENNA. 


ens" Spacing Elect. Phase 
(a) Shift (5 baa rema 
966.8 MHz 0.05% 


1861MHz | 0.096A —— 
1980MHz | 0.102 


Fig. 2 shows a Three-dimensional realization of the 
absolute field radiation pattern for the three-element array 
antenna no. 3, in Table I. The three-dimensional realization 
is a figure of rotation of two-dimensional pattern. The notch 
of radiation pattern is in the direction of the mobile user’s 
head. 


Fig. 2 Theoretical absolute radiation pattern of the array antenna no. 3, 
three dimensional realization. 


III. TRIPLE-FREQUENCY ANTENNA DESIGN 


The triple-frequency three-element array antenna design 
adopted in this work is shown in Fig. 3. Each element of the 
adopted array consists of a monopole inside two concentric 
helical antennae. The arrays are designed to operate at 
frequencies of 966.8 MHz, 1861 MHz, and 1980 MHz 
respectively. The dimensions of each element were 
carefully chosen with the help of a commercial antenna 
simulator, these are: 


Im=37.4 mm, /s=11mm, /h,=23 mm, D,=7 mm, N,=5 turns, 
1h2==20 mm D,=5 mm, N>=4 turns, and d=15.51mm. 


The array antenna was fixed on a metal box with 
dimensions analogous to a standard handset telephone. This 
arrangement can be useful in increasing the low impedance 
of the helical antenna [1], [3]. Fig. 4 shows the 
measurement of the return loss for the helical-helical- 
monopole combination. 


am a she at 


Fig. 3 Three-element configuration of the adopted triple-frequency 


Return Loss (dB) 
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Fig. 4 Experimental results for the Return loss of the triple-frequency 
antenna. 


IV. EPERIMENXTAL SETUP AND RESULTS 


An RF signal generator (100 kHz — 2000 MHz) was used 
to supply 17 dBm output power at the desired frequency. A 
combination of two power dividers (10-dB and 3-dB) was 
used to provide three equal-power outputs. To get the 
desired phase shifts between the elements in each array, 
additional special RF/MW coaxial transmission lines of 
predetermined lengths were used. These transmission lines 
are capable to give 100° for the three-helical array operating 
at 966.8 MHz, 85° for the three-helical array operating at 
1816 MHz, and 85° for the three-monopole array operating 
at 1980 MHz. 


Fig. 5 and Fig. 6 show the results of several experiments 
that were conducted to measure the power radiation pattern 
of the adopted array both in free space and with the 
presence of the human’s head. 
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Fig. 5 Radiation patterns of single and three-element antennae at different Fig. 6 Radiation patterns of single and three-element antennae at different 
frequencies in free space. quencies in presence of user’s head. 
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The results were also compared with the power radiation 
pattern of a single monopole/helical antenna, as a common 
antenna used in the handset mobile telephones. 


The reader can realize the advantages of using an array 
rather than single element in the handset mobile telephone. 
Besides the higher gain it produces, the three-element array 
antenna reveals a radiation beam pattern, which is almost 
flat from one side, thus reducing considerably the EMW 
penetration into the user’s head. 


The authors have suggested an antenna design such that it 
can be practically attached to the mobile telephone as 
shown in Fig. 7. 


K 3.74em 


e—— 10.00 em——__1 
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Fig. 7 Schematic diagram of the suggested handset with actual dimensions 
of the array antenna. 


V. TRIPLE-FREQUENCY ANTENNA BANDWIDTH 


Observing the return loss results drawn in Fig. 4, one can 
see that the bandwidth with return loss value over 10 dB at 
the higher frequency range (1980 MHz) was 5.1%, at mid 
frequency range (1861 MHz) was 4.8%, and at the lower 
frequency range (966.8 MHz) was 6.3%. The bandwidths at 
the three above given ranges can not be considered high, 
thus, in case of operating with high bandwidth systems a 
reactive matching circuit is needed to extend the bandwidth. 


VI. CONCLUSION 


1. Using triple-frequency three-element array antenna 
reveals a noticeable advantage over the triple-two- 
element array antenna. The power radiation pattern 


(1) 


[2] 


[3] 


[4] 


[5] 


seems to be much flatter near the user’s head, this 
reduces even further the expected hazard. 

In comparison with the triple-two-clement array 
antenna, the newly developed triple-three-element 
array shows a considerable increase in gain (about 
70%) for all operating frequency bands, while the beam 
width stays almost the same. 
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Abstract -- In this paper we consider the use of On-Line 
Analytical Processing (OLAP) tools in order to provide fast, 
interactive analysis of the Web site traffic. For the purpose of 
analysis of our hierarchically organized Web site, the directory 
of Croatian WWW servers, an OLAP application (OLAWEB) 
has been developed. We describe data extraction from existing 
server access log files, data transformation necessary to prepare 
data for multidimensional analysis, and the storage of the Web 
site traffic data. The basic characteristics of OLAWEB 
application are explained. Using this application, Web site 
administrator and other users are able to get fast answers to 
many unpredictable and complex questions that could not be 
answered by available statistics tools. 


I. INTRODUCTION 


As more organizations view the Web as integral part of 
their internal and external communications, interest in 
accurately assessing the server traffic is increasing. Server 
access logs present a way to gather a certain amount of 
quantitative traffic and usage information. If interpreted 
properly, log files give an insight into the user interest and 
activity, enable the comparison between parts of a site, and 
give information about the server load or unusual activity. 
Log files can assist in site development and marketing 
strategy. 


Problems may arise when complex queries cannot be 
answered by available statistics tools [8]. There are many 
existing tools that generate fixed reports that include a 
sumple aggregate statistical analysis about the Web site 
traffic, but they do not allow ad-hoc analytical queries. 
Their fixed reporting format is not suitable for analyzing 
log data from many perspectives and they give too little 
detail. Many times the analysis software cannot separate out 
the data about the particular part of the Web site. The 
directory of Croatian WWW servers, WWW.HR, is an 
example of a site where administration, site development 
and maintenance depend on the data that cannot be 
satisfactorily provided by available statistics tools. 


The rest of the paper is organized as follows: the next 
section explains the background of and motivation for 
multidimensional analysis of the Web site traffic. In the 
following sections data extraction, transformation and 
storage for the analysis is explained, the results of our work 
are described, and finally, our conclusion is presented. 


II. MOTIVATION 


WWW.HR is a hierarchical, searchable directory of 
Croatian WWW servers [1]. It consists of 15 root categories 
and more than 300 subcategories. The initial 15 root 
categones include: 


About Croatia, 

Business and Economy, 
News, Media, Magazines, 
Sports and Recreation, 
Tourism and Traveling, etc. 


Every communication between a client browser and a Web 
server results in an entry in the server’s access log. The data 
captured in the server log file is stored in the Combined 
Log Format, which includes information about: the IP 
address or hostname of the computer requesting the file, the 
date and time of the request, the URL for the file being 
requested, the protocol used for file retrieval, the size of the 
requested file, the HTTP status code for the request, the 
referring URL, the browser and operating system used by 
the requesting host. 


The statistics are created on a daily basis using available 
statistics software. The log files are then compressed and 
moved to tape. The data available from the log files 
provides statistics such as: 


e the number of requests made, 
total files and kilobytes successfully served, 
the number of requests by type of file, such as 
html, shtml or pdf files, 

e distinct IP addresses served and the number of 
requests each made, 

e the number of requests by domain suffix (derived 
from IP addresses), 

e the number of requests by HTTP status codes, 

e totals and averages by specific time periods (hours, 
days, weeks, months), etc. 


Starting the analysis with such statistics is a good way to 
begin looking for more specific questions to ask. However, 
the Web site administrator often needs answers to more 
detailed ad-hoc queries in order to improve the design and 
functionality of the site. 


In our case, different categories of the directory of Croatian 
WWW servers are sponsored by different institutions, 
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others seek sponsorship. In order to get it, we need concise 
data that proves users’ interest. In communication with the 
sponsors, we have to answer many unpredictable, complex 
queries, that Web log report generators cannot give, such 
as: 

e What is the percentage of hits from a specific 
geographic area at the given period of time on the 
given category? 

e What is the increase im accessing the given 
category from a specific region dumng the last 
week/month/quarter/year? 

e Whatis the estimated site traffic if the existing 
access trend contmues im the given category? 

e How does a certain event reflect on the access to 
the corresponding category with respect to the 
geographic ongin of the request? 

Available statistic tools are not able to satisfy the growing 
number of similar questions. Potential questions that might 
interest the sponsors mM some cases cannot even be 
anticipated. Configuring the tools, writing and correcting 
the scmpts for each new query would be extremely 
cumbersome and error prone. Finally, the log files grow to 
more than 450 MB each month. Makmg those files 
available for getting needed historical data would overload 
the already limited server resources. Accessing the log files 
stored on different media and different locations would- 
complicate the process further. 


Til ON-LINE ANALYTICAL PROCESSING 


As a solution, we propose the use of On-Line Analytical 
Processmg (OLAP) tools [2.3.9] im order to enable a 
dynamic multidimensional analysis of relevant summary 
data extracted from the access log file. OLAP is based on a 
multidimensional conceptual view of data and provides the 
ability to analyze large volumes of data. End users of 
OLAP applications interactively compose their own queries 
using simple user interface similar to a spreadsheet 
interface. Those users should be able to analyze data across 
any dimension, at any level of aggregation, with equal 
functionality and ease. Multidimensional hierarchies of 
aggregate data enable: 


e viewing of subsets of data, 

e navigating among levels of data ranging from the 

most summanzed to the most detailed levels, 

e viewing data from different perspectives by 

dynamically changing the dimensional orientation. 

e comparisons among different data elements, 

e exception reports to highlight unusual situations. 

e time-series analysis to identify trends, etc. 
Multdimensional data can be stored in a proprietary 
structure called multidimensional database or in a relational 
database. However, end users of OLAP tools should never 
need to be concemed about the storage of data, and should 
be able to treat the resulting database as one logically 
coherent multidimensional structure. 


TV. DATA EXTRACTION AND TRANSFORMATION 


Regardless of the kind of data storage that is goimg to be 
used, the beginning of the process of data extraction from 
access log file and its transformation is the same. 

The starting place for the quantitative analysis of the 
WWW site traffic is the access log file [4]. Only three fields 
are sufficient for our analysis: 


e the hostname or IP address of the computer 
requesting the file, 

e the date and time of the request, 

e the URL of the file bemg requested. 


Based on the information retneved from those three fields, 
three dimensions are defined: 


e Geography 
e Time 
e ©Category 


Valid values for the Geography dimension are Croatia and 
the names of the continents from the onginating request. 
However, the users geographic location cannot be 
accurately extracted from log files. The log file stores the 
client’s IP address: it is a unique number attached to the 
machine rather than an identifier of people. The structure of 
the Domain Name System causes problems m designating 
the geographic location. Geographically, domam suffixes 
such as .com, .org and net could refer to commercial 
enterprises, organizations and networks from any country. 
Other suffixes, such as .edu and -gov refer specifically to 
US domains. In the previous example, however, all of the 
mentioned suffixes form the value of North Amenca. On 
the other hand, we are aware of masking user IPs by proxy 
servers as well as presumably serving requests from proxy 
servers or local caches. However, we estimate that those 
problems do not significantly influence the final results. 


A significant percentage of hits may be unresolved in terms 
of reverse DNS lookups (converting IP addresses to domain 
names, thereby providing the necessary suffix to mterpret). 
During the preparation of the access log, some of these 
addresses can be translated to the proper country — for 
example, addresses of the form 161.53.* belong to Croatian 
Internet Service Provider. Similar procedure can be applied 
for other unresolved addresses, but im this example such 
addresses form the value Unresolved. 


The lowest level of detail is defined by the number of 
accesses to a category from a given geographical area m 
one day. Therefore, the Time dimension has the following 
hierarchy: day > month > quarter > year. Similarly, the 
Category dimension has the values of the refermng 
categories plus four additional special categories: Top of 
Directory, Searching, What's new? and What’s hot?. 


Before counting the number of accesses all log entries that 
give information on graphic files are removed from the 
server access log file. The log file is parsed using Perl 
septs, the data are extracted and prepared for loading into 
the database in the following form: 


day, geographical region, category, number of accesses. 
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V. DATA STORAGE 


The prepared data can be loaded into a relational or a 
multidimensional database. 


A. Multidimensional Database 


In the case of multidimensional database [5], data is stored 
in an array structure similar to the programming language 
array. That array reflects the multidimensional conceptual 
view of data that can be visualized as a cube if there are 
exactly three dimensions as shown in Figure |. Data is 
segmented into cells lying at the intersection of dimensions. 
In our example, one cell represents the number of accesses 
from given geographic area at the given period of time on 
the given category of the WWW.RHR site. 


WWW.RHR category 


ZA=7PCoOa 


Kru psre oa 0 


Figure | - Multidimensional view of data 


Dimensions are organized hierarchically, with a possibility 
to have multiple hierarchies for one dimension. Every 
dimension acts as an index for identifying values within a 
multidimensional array. Physical address of individual cells 
can be computed using array addressing. Cells in 
multidimensional structures are often unpopulated, leading 
to a sparse storage. 


B. Relational Database 


The multidimensional conceptual view of data can also be 
represented in a relational database. The “star schema” is 
the simplest structure based on the multidimensional data 
modeling paradigm [6]. It is composed of one table with a 
multi-part key, called the fact table, and a set of tables with 
a single-part key, called dimensional tables. Figure 2 shows 
the star schema for our example of the Web site traffic 
analysis. Each element of the multi-part key in the fact table 
is in itself a foreign key to a single dimension table. The 
remaining fields in the fact table are called facts. 


In the star schema, the conceptual 3-dimensional array of 
Web access data (as shown in Figure 1) is stored in the fact 
table named Web Access as represented in Figure 2. The 
first three attributes of that table are the indices of the 
fourth attribute named access_count. 


Geography 
( dimension | 
table ) 


( dimension 
geography key 
geography_desc 


time_key 

day_of_week 

month — 
quarter Category | 
year ( dimension | 
: table ) 


category key 
category desc 


Figure 2 - Star schema 


The hierarchy is expressed explicitly in the dimension 
tables where hierarchical levels are shown as attributes. For 
example, in the Time table the hierarchy is defined by 
attnibutes: time_key, month, quarter and year. 


C. Discussion 


If data is stored in a relational database, OLAP tools 
generate SQL to extract data. For that reason relational 
database solutions nearly always have slower query 
performance than multidimensional database solutions. 


Multidimensional database solutions deliver sophisticated 
multidimensional calculation functionality that cannot be 
expressed in SQL. On the other hand, relational databases 
are much more scalable to larger database sizes. Therefore, 
if the key issue is performance, multidimensional database 
technology is essential, but if capacity is the limiting factor, 
at least the most detailed data should be stored in a 
relational database. 


In the case of our Web site traffic analysis, the capacity is 
not a problem, so we use a multidimensional database as a 
data store for OLAP processing. 


VI. OLAWEB APPLICATION 


An OLAP application named OLAWEB has been created 
using Oracle Express tools [7]. This application enables 
interactive composition of ad-hoc queries, navigation 
among hierarchical levels of Web traffic data, the viewing 
of data from different perspectives, and other features 
typical of OLAP applications. Using OLAWEB, the Web 
site administrator and other users are able to get fast 
answers to many unpredictable complex questions that 
could not be answered by available statistics tools. For 
example, the Web administrator may be interested in the 
number of hits from 4 top geographic areas in the fourth 
quarter of the year 1998 on the given categories of the 
directory of Croatian Web pages (Figure 3). 


The results of that query can be visually compared for given 
categories and geographic areas. Further, the percentages of 
hits can be seen. 
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Figure 3 - OLAWEB application 


After that, the user of OLAWEB application will be able to 
ask new questions lke the number of hits by months or 
FRE AA i AE EE ta a 
what the mcrease m the number of hits is. The next step can 
be estimating the site traffic for next few months. 


VIL CoONCLUSDDN 


In thas paper we proposed the use of OLAP applications for 
analyzmg the Web ste trafic Such applications provide a 
mulndmensonal vew of data. and give a fledbke and 
imteraciive access to data Data for OLAP applications can 
be stored m relanonal or m mulndimensional databases. 


If a Web ste has am cnommous traffic volume and the 
scalability ts the problem, a relational database should be 
used far Stommg the Web site traffic data In this case. 
Special dstabase designs (Star schemas) represent the 
mulndimensonal concepiial data model. 


Altematively, multidimensional OLAP servers can be used 
for a direct storage of data in special data structures (Le. in 
multidimensional databases). Multdimensional databases 
are optimized for rapid and ad hoc mformation retrieval, as 
well as for fast and flexible calculation of data. 


We have developed OLAWEB application for 
multidimensional analysis of our hierarchically organized 
Web site. The data extracted from the Web access log file is 
stored in a multidimensional database. Using OLAWEB, 
Web site administrator and other users can dynamically 
navigate among different levels of data detail and view data 
from different perspectives, identify trends and estimate 
future site trafiic. 
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TALOS: An interactive Web-enabled 
educational environment on mobile robot 
technology 


S. Athanasiou, I. Kouvaras, I. Poulakis, A. Kokorogiannis, P. Tsanakas and N. Koziris 


Abstract-- The Talos educational environment is an attempt to 
approach the _ constructionist theory of learning. 
Constructionism asserts that knowledge is not simply 
transmitted from teacher to student, but is actively 
constructed to the mind of the learner. Moreover, 
constructionism suggests that learners are particularly likely 
to create new ideas when they are actively engaged in making 
external artifacts that they can reflect upon and share with 
others. The presented system (TALOS) deals with an 
interactive simulation environment for learning the principles 
of mobile robot operation and programming. 


Index terms-- Education, Robot, Simulation. 


I. INTRODUCTION 


Our work (sponsored by the EU MODULATES project) 
aims to support the delivery of technology in the classroom 
environment of both primary and secondary schools [1]. 
Within this context, a course on Mobile Robot Technology 
was developed. Similar efforts [2],[3],[4],[5] in the field of 
distance learning educational materials using simulations, 
address mainly to higher education students. 

Furthermore, all available educational material in the 
market is more or less accompanying a certain mobile robot 
[1], [2]. There are a lot of companies offering educational 
mobile robots, from which only a small portion is intended 
for secondary school pupils. As a result, the design and 
creation of the whole project begun on a nominal basis. 
Special attention has been given to the special needs of the 
students and the teachers of secondary schools. 


The challenge was to avoid becoming over-technical in our 
descriptions, or over-simplistic. A text with many technical 
terms and complex ideas would certainly not be appealing 
for the students. The effort and the anxiety of answering to 
the pupil's questions would become teachers' responsibility. 
On the other hand, if we avoid mentioning. certain 
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fundamental ideas and concentrated on _ gathering 
multimedia resources with astonishing pictures and huge 
videos, the educational value of the whole project would be 
diminished. 


The final decision was so precisely posed several thousand 
years ago from Aristotle: “ The mind is not a pot to fill with 
knowledge, but a fire waiting to be started”; thus we have 
chosen to develop an easy-to-use environment that 
facilitates experimentation on mobile robot operation and 
programming. 


II. THE TALOS EDUCATIONAL ENVIRONMENT 


We have developed our course on top of WebCT [7], a 
web-based publishing environment. This environment 
allows every schoolchild to have his private files referring 
to his progress and other personal information. Other useful 
resources of our system include email for all the students, 
virtual discussion rooms, announcement board, dairy, and a 
full searching system to find information on topics relative 
to the course. The teaching material, was developed using 
dynamic HTML pages, enriched with pictures, audio and 
video clips, along with Java and Perl programs. 


The actual content of the course aims to offer a general 
introduction over Robotics and Artificial Intelligence. 
Fundamental principles are explained through examples and 
topics for conversation. Multimedia resources —video and 
pictures- are used where available. Pupils are encouraged to 
explore a series of proposed links related to the subject 
under discussion. 


The course is organized in eight two-hour lessons. The first 
lesson is a general introduction, explaining fundamental 
concepts and principles of Robotics. The history and 
evolution of robots is also covered. The second describes 
the Pathfinder Mission and the Sojourner Micro-rover. The 
topic was chosen since it is one of the most fascinating 
application of Robotics ever seen. In the next lesson pupils 
examine the design principles Mobile Robots and get 
familiarized with the available sensors and their technology. 
The last theoretical lesson is a presentation of existing 
mobile robots and international robotic competitions. 
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The rest of the course is dedicated to the simulation of a 
virtual mobile robot. It is the heart of the project because of 
its high educational value and the originality. We have 
developed an advanced educational robotic simulator, 
whose target group is secondary school children. A similar 
project, but of different approach, is available from LEGO 
Mindstorms in collaboration with scientists from the 
MIT{[10]. 


III. THE SIMULATION ENVIRONMENT 


Our efforts were focused to provide a virtual robotic 
laboratory, since building, controlling and programming a 
real mobile robot would require expensive equipment and 
specialized teaching personnel. Again, the success of such a 
course would not be guaranteed; most mobile robots in the 
market are programmed in C/C++ (if they can be 
programmed at all), while the problem of pupils sharing the 
same equipment would remain unsolved. In such a 
situation, students do not experiment; they are passively 
receiving knowledge as they would do by reading a book. 


Trying to solve those problems proved to be a strenuous 
procedure, but it provided us with the idea of a completely 
different approach to the subject. We created a Virtual, 3D, 
Interactive Environment called Talos (the first robot ever to 
be constructed, according to ancient Greek mythology). 
This platform consists of : 


e a Virtual Universe (VU), where the robot is situated. 

e a Generic Robot (GR), able to wander within the VU 
and interact with it through a series of sensors. 

e a Visual Programming Environment (VPE), that 
allows students to program the GR using a high-level 
language developed specifically for this project. 


Within this framework, students experiment on different 
robots and scenarios, understand the basic concepts of 
Robotics and Artificial Intelligence, while learning to use 
the computer as a powerful educational tool. A related 
course on Mobile Robot Technology gives them the 
necessary theoretical background and introduces the 
Internet as a supplement to the learning process. 


We will describe the three components of the Talos 
platform and a simple example of a simulation. 


A. Virtual Universe 


The VU (fig.1) is organized and constructed in a way that 
can be easily tailored to different scenarios. For the time 
being, users cannot create their own universes; an editor is 


being developed as a complement to future versions of 
Talos. 


For the implementation of the 3D world, we have used a 
relatively new programming API introduced by Sun as an 
extension of Java 2 (or 1.2), called Java3D. While retaining 
standard Java’s advantages of cross-platform compatibility 
and Internet-orientation, it provides high quality 3D 


graphics. In the early versions of Talos, standard Java 
(AWT) was used; the result was poor graphics and slow 
execution, especially if the program run through the 
Internet. 


B. Generic Robot 


The GR provides us with the capability of creating mobile 
robots that match our exact needs, just like designing and 
building a real robot. We can change the robot’s size, 
shape, color and armament. There is a wide range of 
available sensors to choose such as: laser, microwave radar, 
GPS, camera, IR-heat-touch sensors. Pupils will eventually 
pick not only the correct strategy to solve a problem, but 
also the most efficient set of sensors. 


Fig. 1. A Virtual Universe 


We intend to develop a 3D editor where pupils can drag- 
and-drop a sensor, pick the color of the robot and determine 
its form. This feature further complies with constructionism 
and allows the students to personalize their robots. Ideally, 
every student should have his/her robot. This way the 
primary goal of successfully replacing a real robotic 
laboratory is achieved. Furthermore, we compensate for the 
lack of hands-on experience with a wide variety of sensors 
and scenarios (games) that trigger the pupils interest. After 
all, commercial educational robots have extremely limited 
‘skills’ and any added features such high-tech sensors, are 
extremely expensive — well above the budget of an average 
secondary school. Again, the outcoming profit would not 
match the cost of the investment. 


For the current version of Talos, three different robots have 
been pre-constructed, each with a different orientation. The 
first is a space rover called Odysseus (Ulysses), similar to 
the Sojourner rover NASA sent to Mars in the summer of 
1998. Odysseus would be an example of a high-tech robotic 
application, incorporating advanced sensors and scientific 
equipment. The relative scenario requires from the students 
to program the rover so that it can find all the rocks inside a 
certain area and perform a set of scientific measurements. 


The second scenario is inspired by the RoboCup Initiative 
which is encapsulated into the following paragraph : 
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By mid-21st century, a team of fully autonomous humanoid 
robot soccer players shall win the soccer game, comply 
with the official rule of the FIFA, against the winner of the 
most recent World Cup. 


Students are certainly not going to be the ones to implement 
this vision; they are simply asked to program a mobile robot 
that can find the ball and score. The robot they control is 
called Machlas, named after a Greek football player. 
Compared to Odysseus, Machlas would be far more simple 
and relatively inexpensive (unlike the real player). 
However, this does not make its programming easy. 
Actually, it is a very general introduction to the field of AI. 


The third scenario is the most challenging and interesting. It 
requires from the students to create a working, smart 
vacuum cleaner. The robot that will accomplish this task 
belongs to a fictional industry, and it is called Acme VC. It 
is the least equipped from the above and the most difficult 
to program. The whole idea behind this scenario is to show 
students how robots might be used in the future (this subject 
is widely covered in the course of Mobile Robots, Lesson 
4). 


More scenarios are being developed, featuring complex 
principles of AI. We did not wish to become simplistic; 
similar projects have clearly shown that students can grasp 
complex, yet fundamental concepts with relative ease. 
Proper learning aids and careful guidance are the keys to 
success. 


C. Visual Programming Environment 


Probably the most challenging part of the project, since it 
required our development team to create an easy-to-learn, 
easy-to-use programming language. If needed, more 
complex algorithms should be able to be implemented. For 
the evaluation version of Talos we had developed a Pascal- 
like pseudo-language. 
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Fig. 2. Visual Programming Environment. 


That version had several drawbacks. Students should get 
acquainted and learn that new language, assuming that, 


pupils do not have any programming skills. Furthermore, 
programming robots is usually made using decision trees. 


After extensive experimentation, we developed a visual 
programming interface (fig.2) with commands in natural 
language. Visual, for ease in writing the program. Natural 
language, so that students could program from the first day. 
We will demonstrate this environment through an example. 
What follows in fig. 3, is a program allowing a virtual robot 
to wander in a course and find the obstacles in it. 
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Fig. 3. Programming example 


\f camera not clear 


The robot is carrying a camera looking forward, and a 
radar. If the radar is clear, we take a step forward hoping to 
find an object. If there is an obstacle within the radar range, 
the robot automatically records its position and tums 
towards it. It is now up to the camera, which it has a shorter 
range than the radar. If the camera can see the obstacle, it 
identifies it. Otherwise, we take a step forward in order to 
get closer. 


We have made certain conventions, so that the radar works 
with multiple objects (by focusing to the closest one to the 
robot). We integrating the sensors and the memory of the 
robot, allowing it to automatically avoid objects that have 
been already identified and ignore them as well (as if they 
did not exist). Students will be able to compose this 
program (or if they do not, at least they will understand it). 


IV. SUMMARY 


The Talos platform is a valuable learning aid for secondary 
schoolchildren. Along with the relevant lesson of Mobile 
Robots it can help pupils understand the main principles of 
Robotics, bringing them closer to modern technology. 
Finally, they are encouraged to use Internet as an 
educational tool, since a significant part of the course is 
based on Web resources. 
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Abstract—Current society manifests a growing interest in the 
penetration of new Information Technologies to the daily 
citizens life. Although there are still a few problems to solve, it 
seems that the most efficient way to arrive to the users is 
through a Set-Top Box terminal (STB) connected to a 
conventional or digital TV set. This paper analyzes these 
penetration problems that current solutions represent and 
introduces a new STB approach that adapts to the user profile. 


Index terms-- Information Society, Internet, Multimedia 
Services and Platforms, TV, Broadcast/Multicast, IP 
Networking, Datacast, Set-Top Box, software Tele-load, 
adaptable GUI. 


I. INTRODUCTION 


Users are mainly interested in services than in technology. 
Also, they pay attention to those services that are affordable. 
In addition, the massive pool of users (the big market) 
demand services that are offered to them at the residence 
without an active action from the user itself. Contrary to 
present services where the user has to follow a process to 
connect to a network and access to them. Furthermore, this 
process currently is not seamless or not transparent 
technologically speaking. Therefore, the Information 
Society advances are only enjoyed by a small sector of the 
population and with a low social penetration [1]. This sector 
is around very young people, educated ones, qualified 
professionals, etc., but does not cover a wide range of ages, 
jobs or formation skills. There is a real danger regarding 
technology and the “classical” way to view Internet 
(physical PCs connected to a congested network), since the 
population may be separated, in a short term, into 
“technology illustrated” people and those that are not. 
Taking this into consideration, we propose to design and 
implement a “domestic” multimedia and open-architecture 
terminal used in combination with a TV set. That is, a Set- 
Top Box for the mass market of consumers, but open in the 
hardware architecture as well as in the software and services 
to build over it. It has to be cost-effective and also it is 
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interesting to use a transmission/reception media, cheap 
enough or, if possible, free of charge. We propose a basic 
configuration based on a platform with some expansion 
slots. In-one of the slots there is a daughter board that 
receives and decodes multimedia Data (encapsulated into IP 
datagrams) that is transported in the datacast [2] system of a 
conventional TV channel. It provides the user a down 
channel with a capacity some times higher than some of the 
current solutions to access the Internet. The channel can be 
tuned from aerial emissions or from cable-operators. A 
private channel can be offered to the user using IP and data 
ciphering. The possible services to be offered are enormous, 
one of them being the TV-mail [3] service, which is also 
described in this paper. TV-mail (different from email) tries 
to be an alternative to the delivery of some documents in 
hardcopy paper support, offering at the same time, 
environmental benefits. 


The remainder of this paper is organized as follows. Section 
II gives the entire system overview. In section III the main 
features of the STB proposed are explained. Sections IV 
and V detail in depth the STB characteristics and the 
possible services. Finally, some conclusions are given. 


II. SYSTEM OVERVIEW 


The key issue of the system is breaking the phobias and 
fears that normal people have towards new technologies. To 
achieve this, a closer look to a regular standard home gives 
the appropriate insight: Which telecommunications systems 
everybody has at home? Phone and TV sets. Therefore the 
end-user terminal has to use, at maximum, a combination of 
both currently and massively deployed telecommunications 
systems. 


A second key issue of the entire system is that it uses 
existing and well known systems. IP networks (a “de facto” 
standard) are usually for distributing contents to a server, 
which is responsible of collecting and formatting 
information. Conventional (both aerial and cable) TV 
channels can be used for broadcasting multimedia 
information to a massive pool of users [4]. 


IP data encapsulation is employed to integrate the Internet 
and TV. Using IP datagrams to carry out multimedia data 
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over the Datacast system, already incorporated in the 
conventional TV signals, enables service development 
independently from the higher protocol tower selected, and 
even allows to add mechanisms to measure Quality of 
Service (QoS) end to end, in the prototypes with return 
channel. 


The global system, by nature, is designed to offer a 
transmission channel from a distribution center to the end- 
user home. The system can be divided into three different 
elements: The distribution, the broadcast and the reception 
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Figure 1. Distribution, broadcast and reception parts 


Distribution is achieved by means of a server that compiles 
information from content generation client applications. 
Note that these client applications can be considered a 
distributed system that generates contents to be transmitted 
to multiple users. The server includes scheduling policies to 
decide when to send each information received. This 
information is formatted, compressed and encrypted if 
needed, and finally sent to the broadcast subsystem. 


The broadcast part only modulates the information and 
transmits data through the Datacast subsystem associated to 
a conventional TV transmission unit. Using Datacast 
integrated into a conventional TV video channel it is 
possible to transmit up to 204 Kbps. This is, 30 effective 
bytes per line (discarding control and synchronism bytes), 
34 lines per field and 25 fields per second. Using a TV 
channel dedicated exclusively to Datacast (no video signal), 
it is possible to transmit up to 3,6 Mbps (610 lines per 
field). Almost twice of a basic rate or a primary rate 
narrowband ISDN! - 

Note that this calculation has been performed for a TV PAL 
system. Also, the IP datagram overhead is not computed. 


Reception is performed by the Set-Top Box connected to a 
TV antenna, as described in the following section. 


Ill. SET-TOP BOX OVERVIEW 


The Set-Top Box (STB) is placed between the TV antenna 
(or cable TV) and the TV set. It extracts Datacast data, 
decodes and decrypts if necessary, and stores data in local 
memory. Under user request, information is displayed on 
the TV screen in a user-friendly way. 


The system, from the end-user point of view, has to be a 
“black box” that presents potential interesting information 
on the TV screen. It also has to be very easy to use, 
therefore the system is operated through a standard remote 
control, a well-known device existing in every home. The 
STB proposed is a system based on a general purpose 
Personal Computer (PC) architecture (hidden to the end- 
user). This architecture allows different system 
configurations, according to different user needs. In 
addition, it is cost-effective in comparison to customized 
hardware. 
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Figure 2. Set-Top Box block diagram 


The basic configuration consists of a motherboard with 
several expansion slots and a Mediacast board, specially 
designed and developed by us, to receive and extract 
multimedia data encapsulated into IP datagrams, carried 
onto a conventional TV Datacast channel. This board is also 
responsible of the teleload mechanism. Multimedia data is 
presented on the TV screen like another TV channel. 
Advanced users can select to access information through a 
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browser, a very well known tool for the Internet user, 
achieving some interactivity with a downlink channel only, 
since retrieved multimedia information is already stored in 
the system. With this configuration the information flows to 
the user, that plays a passive role in the activities related to 
connecting to a network. The user can navigate the 
downloaded information through the browser, as the 
information retrieved is local, the navigation speed is 
practically instantaneous. 


A little improvement of the basic configuration incorporates 
a modem to allows a conventional Internet access with a 
return path, but adding the benefit of an extra, almost free, 
downlink channel. Notice that this configuration approaches 
also the concept of a conventional PC connected with a 
modem to the Internet, so this configuration can be used 
both by people that have experience with the Internet and 
people that panic with this word. That is, progressing from 
the basic to the modem configuration, people ends up being 
Internet users without noticing it, in a natural and seamless 
way. 


Besides that, other configurations can be designed very 
easily: The open architecture allows plugging hard drives, 
DVD’s, CD-ROM’s, etc. The operating system of the STB 
is derived from Linux [5], which is open-source code. 
Applications are developed in Java (portable and hardware 
independent platform) [6]. In summary, the entire system is 
open, scalable, expandable, cost-effective and easily 
adapted to future hardware enhancements and new services 


IV. SeET-Top BOX CHARACTERISTICS 


Remarkable characteristics of the presented STB are: 

User adaptable GUI: Through an intuitive Graphic User 
Interface (GUI) that starts at very novice level, different 
users can adapt the STB to their technologic level. The STB 
is able to remember user profiles. 

Open architecture: That makes the STB expandable in 
hardware, so it can grow adapting to new needs, as digital 
TV, future standards, etc., by only incorporating new boards 
for the new function. 

Multiservice: In opposition to the majority of the STB 
systems, dedicated to either analog, satellite[7][8], digital, 
terrestrial TV, ..., and particular applications (interactive 
TV, Internet access, tele-banking, stock exchange, etc.). 
Therefore, this STB is not dedicated to a specific service. 
Teleload: It is one of the most important characteristics. 
Allows software updating, both operating system (OS) and 
applications. This way the STB adapts to the raising 
services. According to new services, new software is 
loaded, without the participation of the user, or, users get 
new services automatically without any additional effort. 
Zero cost: The downlink channel is almost free. Using the 
modem configuration users may have a conventional PC 
style system with an extra, fast (up to 3,6Mbps), free 
downlink channel (conventional TV is usually free in most 
of the countries). With a conventional modem speeds up to 


56Kbps can be achieved (up to 128Kbps using ISDN) for 
the return channel. Obviously, the downlink channel is 
shared among all users as it also happens in the current 
Internet access applications. 


V. SET-TOP BOX SERVICES 


A large amount of new services, only limited by creativity, 
can be provided through combinations of STB, broadcasting 
and return channel use. Starting with a basic configuration, 
the following services are proposed: 


Advertising campaigns: Using broadcasting _ basis, 
advertising arrives to every covered STB. This is equivalent 
to receiving advertising brochures in your regular mailbox. 
This campaigns can substitute the paper based ones, adding 
additional functionality by its multimedia nature, and 
environmental benefits. 


Government campaigns: Similarly, tax period declarations, 
instructions on how to vote, public interest information, etc. 
In general, public institutions information broadcast. 


Private channels over a broadcast system can be also 
offered by using encryption techniques [9]. This principle 
allows services like TV-mail, similar to regular mail but 
using electronic support. This service intends to be an 
alternative to regular mail. The added value is paper saving, 
short time to delivery (practically instantaneous), 
multimedia content in the documents, easy to use, friendly. 
Privacy, authenticity and integrity are supplied through 
cryptography. TV-mail service can be used, e.g., to send 
banking information to the users, like the balance of an 
account, special dedicated offers, etc. 


With a partial use of cryptographic techniques, a Video 
Club service can be provided. Also the distribution of music 
on demand is possible. In combination with a smart card, 
that can be read by the STB, payment methods may be 
available [10] . 


Another important service from the academic point of view 
is the possibility to implement tele-education services, such 
as registration forms, calendars, academic rules, marks, 
course guides, lab exercises or class notes. 


VI. SUMMARY AND CONCLUSIONS 


In the context of the “Information Society” it seems that 
there is an agreement on reusing infrastructures and 
networks, accepting “de facto” standards like IP, and 
developing simple and low cost platforms STB style. For a 
seamless migration to reach the mass market, it is necessary 
to hide technological issues and to facilitate the use of the 
devices employed. According to these concepts, a new STB 
approach that adapts to the user profile has been presented. 
This STB applies to conventional TV and _ telephone 
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infrastructure. It has adaptable GUI, it supports Teleload 
and it is open both in architecture and services to be offered. 
Also, this STB model minimizes the differences between 
people with and without technological skill, resulting in a 
less traumatic social evolution progress. This fact has thus, 
non negligible benefits for the modern society. 
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Functional Description and Engineering of the 
Dolmen HIB Proxy Server. 


G. Vardangalos, A. Christophoros, M. Anagnostou 


The current WWW technology in the Internet is characterised 
by a number of problems. Performance is poor due to the lack 
of suitable caching and propagation methods for frequently 
accessed documents. Finding information is difficult since 
WWW has no built-in directory service. 


In this paper the functionality of a Hypermedia Information 
Browsing (HIB) Proxy Server is described, as well as more 
details about the way it is implemented. In the European 
ACTS DOLMEN (Service Machine Development for an Open 
Long-term Mobile and Fixed Network Environment) project 
two basic ways have been identified of providing Information 
Browsing on OSAM Platform, that is, the Open Service 
Architecture for a mixed fixed and mobile environment. These 
are called the gateway approach and the full OSAM approach. 
It is decided to implement a browsing application based on the 
gateway approach. The gateway approach involves two 
subsystems: 1) HIB Browser running on the end-user's 
terminal, and 2) HIB Proxy Server running on a Service Node. 
The objective of the gateway approach is to provide enhanced 
Web browsing service for end-users on the OSAM platform. 
The TINA sessions and stream communication are used 
between the HIB Browser and HIB Proxy Server. 


Index Terms— DOLMEN, OSAM, HIB Browser, HIB Proxy 
Server, TINA, Jigsaw, Java, HTTP, Service Nodes 


I. INTRODUCTION 


Following the components in the gateway approach, the 
purpose is to provide an enhanced HIB Proxy service based 
on the TINA session and communication models. As 
illustrated in figure 1, the main objects consisting the HIB 
Proxy Service are the permanent objects: Cache Manager 
(the Jigsaw Proxy Server), Browsing Service Factory and 
Web Client Factory; the semi-permanent Web Client object 
that provides Internet connection to a Web Server and the 
private HTTP Proxy Server objects, one for each browser. 
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Cannectiviy Session Objects 


Figure 1: Computational Objects in DOLMEN 


The HTTP Proxy Server offers the HTTP Agent an 
operational interface called i HTTP Request. Through this 
interface all document fetch requests coming from the 
browser are delivered to the proxy. The Proxy Server uses 
the operational interface called i HTTP_Reply at the HTTP 
Agent to deliver ReplyIDs and error messages to the 
browser. The actual data is transferred in a stream that 
supports multiple prioritised flows. 

When the HIB Browser is started, a new instance of the 
HTTP Proxy Server is created, to serve this browser. HTTP 
Proxy Server instances are created by the Browsing service 
factory upon request of the User Agent, after the access 
session is completed. The HTTP Proxy Server implements 
both HTTP specific features as well as the TINA service 
session model. Being a Service Session Manager (SSM) the 
HPS interacts with the Communication Session Manager in 
order to establish a communication session between the 
proxy and the browser. 

When a stream is created between the browser and the 
proxy, the user can start the document fetch. The HTTP 
Proxy Server gets a request to deliver a document (HTTP 
GET method). It first tries to resolve the request from the 
Cache Manager. If the requested document is not in the 
cache, then the HTTP Proxy Server invokes the Web Client 
Factory to get a handle of a Web Client that is serving the 
requested Web Server. If the Web Client does not exist, 
then the Web Client Factory creates a new one. The Web 
Client, when invoked, by the proxy server creates an empty 
locked document in the cache and requests the document 
from the Web Server. When the Web Client receives the 
document, it fulfils the empty document in the cache, 
releases the lock, and informs the HTTP Proxy Server that 
the document is now in the cache. Then, the HTTP Proxy 
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Server can deliver the requested document from the cache 
to the browser through a specific stream channel. 


Il. OVERVIEW OF THE JIGSAW SERVER ARCHITECTURE 


The Proxy Server is a Java process. It is simply the publicly 
available Jigsaw HTTP Server configured as a proxy server, 
without any modifications. The Jigsaw Proxy Server is used 
via HTTP/1.0. If necessary, the Jigsaw Proxy Server can 
easily be replaced with another HTTP Proxy Server. 

It is a W3C's sample implementation of HTTP. It is a full- 
blown HTTP server entirely written in Java. Its design goals 
were: portability so as the Jigsaw server to run on any 
machine running Java, extensibility so as the server to be 
extended by writing new resource objects, efficiency so as 
to minimize file system accesses. By using some caching 
mechanisms, the server will reduce file system accesses, 
eventually to the point that most requested documents 
serving will require no file system access. 


Jigsaw is made of two distinct modules, linked through a set 

of interfaces: 

e The daemon module deals with the HTTP protocol: it 
handles new incoming connections, create new client 
objects, decode requests, and send replies. 

e The resource module is some representation of the 
information space. It is responsible for generating reply 
objects out of the incoming request objects. «> 

These two modules are linked through a set of Java 

interface specifications, so they can. be replaced 

independently, provided they implement adequately the 
interfaces. 


III. ENGINEERING OF THE DOLMEN HIB Proxy SERVER 


Figure 2: Overview of the DOLMEN HTTP Proxy 


The DOLMEN HIB Proxy Server consists of five main 
modules (see Figure 2): 


1. Global Session Control (GSC) takes care of generic 
session control tasks such as establishing streams. 


2. The Proxy Module contains all application logic needed 
to implement the mobile-aware features of the HIB 
application. The Proxy Module does not know anything 
about the DOLMEN Service Machine. 


3. The Communication Module contains all knowledge 
about the DOLMEN Service Machine needed by the 
DOLMEN HTTP Proxy, and offers a simple generic 
communication interface to the Proxy Module. 


4. The Global Service Segment (GSS) a.k.a. Java Adapter / 
Fixed implements the CORBA interface offered to the 
DOLMEN HTTP Agent by the HTTP Proxy. Due to 
implementation reasons, this interface has to be 
implemented in C++, whereas the DOLMEN HTTP Proxy 
is implemented in Java. The problem was solved by 
implementing GSS as an adapter, which forwards all 
received CORBA invocations to the Communication 
Module via an internal CORBA interface. 


5. The Jigsaw Proxy Server functions as a global HTTP 
cache. Since Jigsaw implements HTTP/1.1, it also allows us 
to benefit from the HTTP/1.1 performance improvements 
without having to implement them ourselves. 


IV. BROWSING SERVICE USAGE 


The following is a scenario for getting documents from the 
proxy (Figure 3). In this scenario there are two steps: in the 
first one the user requests a document and the document is 
transferred back to the user. In the second one, the user 
requests another document but it is decided either by the 
user or for other reason that the document is aborted. 

1. The UAP requests a document from the proxy server by 
invoking the request operation to the GSS. 

2. The GSS fetches the document either from Internet or 
from the local cache. It parses the document for inline 
images. 

3. The GSS sends a reply back to the UAP identifying the 
document to be transferred. 

4. As soon as the document is ready for transfer the GSS 
requests the SI to start transferring the document by 
invoking the AddFileInQueue operation. 


5. Data is transferred through the stream flows. 


6. The user requests another document, so the UAP sends 
another request operation to the GSS. Step 2, 3, 4 should be 
repeated. 

7. a) Abort between 2 and 4. The GSS returns a list of 
image URLs that will not be transferred because of the 
abort back to UAP. 

b) Abort between 4 and 5. The GSS should abort the 
transfer of the document through the stream before 
retuming the URL list. 
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8. b) The GSS requests the SI to stop transferring the 
document by invoking the RemoveFileFromQueue 
operation. 


UAP i HTTP AccosRedy+i HTTP AgetComrd+? © GSS i HTTP AcossRaysst+i HTTP ProoCoord soot 
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Figure 3: During Browsing 


V. IMPLEMENTATION OF THE DOLMEN HIB PRoxy 
SERVER 


A) Implementation of Java adapter 


The Java adapter is a separate process from proxy, and it 
communicates with the proxy by means of inter-process 
communication. The method of communication is an 
internal part of the proxy. The interface between parts of the 
proxy is defined as follows (see also Figure 2): a class is 
implemented called HTTPProxy with well-known static 
methods, that forms the interface between the two parts. 
The class consists of the following methods: 


e request(String method, String url, Hashtable attrs, 
InputStream data): processes the HTTP/Access request. 
It doesn’t retum before the request has been 
successfully sent to the Jigsaw proxy server. If the 
request method is “POST”, the method doesn’t return 
betore we sent the data for the operation. The 
parameter method stands for the request method, e.g. 
“GET”, the parameter attrs for the request attributes 
(headers). If method is “POST”, the data for the 
operation can be read from the input stream, otherwise 
it’s null. 


e setImageSizeLimit(int limit): sets the image size limit 
accepted. A size limit of zero means that no images are 


fetched. A negative size limit means all images are 
fetched 


e —setFilters(Vector filterList, Vector paramsList, Vector 
applyToList): This operation passes three Vectors. The 
parameter filterList contains the filter names, the 
parameter paramsList contains the filter parameters and 
applyToList contains the filter restrictions. The filters 
are to be used to implement compression. The filters 
are external programs that the Proxy should spawn as 
subprocesses. For example, if gifs are to be compressed 
to jpegs, the proxy must spawn a subprocess that runs 
the converter write the gif data to the converter, read 
the jpeg data from the converter and write the jpeg data 
to the agent. The setFilters also specifies exactly what 
program should be spawned and with what parameters. 


e setBackgroundImages(boolean b): sets whether 
background images are fetched or not. If the parameter 
b is true then background images are to be fetched, 
false otherwise. 


e setInvariantRequestPart(Hashtable ht): passes a 
hashtable that contains the new request headers to be 
used as a parameter (header names are keys, header 
values are values). The parameter ht is a hashtable 
filled with the invariant request part (strings only). The 
invariant request part is a part of the HTTP requests 
that the browser (e.g. Netscape) sends, is fixed. The 
fixed part consists of "User-Agent: Netscape..." and 
"Proxy-Connection: keep-alive". Since the fixed part is 
known to be unchanged in each request, it is 
unnecessary to send it over the wireless link more than 
once. All that needs to be done in the HTTP Proxy is to 
save the invariant part when it is set, and then add it to 
the attributes hashtable (when the 
HTTPProxy.request(...) method is invoked) for each 
incoming request. 


e abort(String url): aborts the transfer of a document and 
all embedded: documents. The proxy keeps track of 
which inline images belong to which document(s), so 
that the correct inline image fetches are aborted. The 
parameter url is the document to be aborted and the 
method returns a list of the inline image urls the 
fetching of which was stopped as a consequence of the 
operation. 


B) Implementation of the parser 


As we’ve already mentioned, when an HTML document 
arrives it is parsed in order to determine the inline images it 
contains. In order to implement this, we have used the class 
Parse(). It imports three additional classes: 


e HtmlStreamTokenizer is an HTML parser and is 
specialized for HTML streams. This class is useful 
when it is needed to parse the structure of an HTML 


10™ Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 397 


document, e.g., downloading a WEB site, indexing a 
file's META tags, verifying links. 


An HTML document can be classified into three broad 
token types: tags, comments, and text. Tags are the familiar 
HTML formatting tags, e.g. <img src="image.gif">, <a 
href="http:/www.yahoo.com">, etc. Comments are 
sequences of characters enclosed by special delimiters, i.e. 
<!-- a comment ->. Text is everything else. The 
HtmlStreamTokenizer interface is based on _ this 
observation. 


e HtmlTag is a helper class to store parsed tag 
information. 


e HtmlException extends Exception. 


VI. CONCLUSIONS 


In this paper the functionality of a Hypermedia Information 
Browsing (HIB) Proxy Server was described, as well as 
more details about the way it is implemented. It is also 
presented how the TINA concept is applied, some 
functional descriptions and engineering issues 


The implementation of a browsing application based on the 
TINA gateway approach is shown. As we said before, the 
gateway approach involves two subsystems: 1) HIB 
Browser running on the end-user's terminal, and 2) HIB 
Proxy Server running on a Service Node. The objective of 
the gateway approach is to provide enhanced Web browsing 
service for end-users on the OSAM platform. The TINA 
sessions and stream communication are used between the 
HIB Browser and HIB Proxy Server. 
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Agent based retrieval of information from 
Internet 


Matevz Pogacnik, Jurij F. Tasic 


This paper presents architecture of a Web-documents search 
system. Its modular architecture enables addition of new 
search modules at any time, personalized selection of 
retrieved documents, and usage of security mechanisms like 
message encryption and authentication. 


agent architecture, information search, meta-data, 
personalization 


I. INTRODUCTION 


Integration of information on the Internet and its retrieval 
has nowadays become a significant problem. 800 millions 
of web pages make a wealth of data readily available, but 
getting the information we want is a difficult and time- 
consuming task. 

In this paper we have focused on the architecture of the 
agent system, which would enable personalised search for 
information (documents), dispersed in the Web. First, we 
are going to show, that today’s Web documents are not 
adequately described and that we need to start using sets of 
meta-data for their description. Then we are going to take a 
look at the system architecture, which should enable 
inclusion of new information sources into the system, 
regardless of their content and structure, without the need 
for significant changes at the user side of the system. User’s 
side of the system should include personalisation 
module(s), which keep track of user actions and enable 
personalised selection and presentation of documents found 
in search process. The security issues are also discussed and 
some solutions in the system architecture are suggested. In 
the end a simple system implementation is presented. 


II. DESCRIBING THE DOCUMENTS 


Successful search of documents requires adequate 
information about content of the documents. A lot of 
today’s search engines use more or less efficient text 
analysis of the documents [7], which very often results in 
strange search results, to say the least. Furthermore, most of 
search engines are returning results that contain very little 
data about found documents, such as document title, its 
location (URL) and maybe date of creation. This 
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information does not allow for any personalisation at the 
user side. 

Taking these facts into account we can make the following 
conclusions. First, the documents need to be described by 
sets of descriptive data, often referred to as meta-data. 
Second, search engines or other interfaces should return 
these meta-data to the user side of the system.. Since the 
automatic text analysis, used by search engines of today, 
has not yet sufficiently matured, the documents should be 
described by their authors or by third parties. 

Nevertheless, the question remains: “Which meta-data sets 
should we use?” Different documents have different 
purpose and usage, like educational, informational or 
entertaining, to name just a few. It seems obvious, that 
using the same meta-data set for all types of documents 
would result in huge amounts of meta-data, describing each 
document, out of which only a few would be useful for each 
of them. Therefore we believe that documents should be 
described by specialized meta-data sets, containing some 
general and specialized information. 

It is also very important that these meta-data sets are 
returned to user side of the system in order to enable 
personalization of search results, which will be discussed 
later in the document. 

At this point we should take a look at system architecture. 


III. SYSTEM ARCHITECTURE 


The search system has been divided into three layers, which 
are: 
e Source layer, contains all the sources of 
information and agent interfaces to those sources. 
e Client (user) layer, is the user's side of the system 
with agents that follow user's habits and behaviour. 
e Intermediary layer, offers accompanying 
services like "agent yellow pages", authentication 
etc: 


Some of many advantages of such architecture [4] are 
possibility of changes on any of the layers, without having 
to change anything on other two layers. In such way each of 
the layers (agents) can focus on tasks it does best. 
Furthermore, different information sources can be included 
in such system, regardless of their internal structure and 
search (query) mechanism. User side agents use 
intermediary layer agents while looking for new 
information sources and for authentication purposes and 


10 Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 399 


0-7803-6290-X/00/$10.00 ©2000 IEEE 


access (query) information sources through adapted 
interfaces. 

The concept of three-layer system architecture is presented 
in Figure 1. 


ie z 


| Client (User) layer 


Available service query, 
4 Search requests authenication... 
Information 
(document) 


delivery | 


Intermediary layer | 


Service registration, 
authentication 


Source (information) layer 
bol 


INFORMATION 
(INTERNET) 


Figure 1: three-layer architecture 


A. Intermediary layer agents 


Successfully working system requires, besides service 
(search) and client (user) agents, some intermediary agents 
that provide accompanying services. The most important 
among these services are agent yellow pages and 
authentication. 

Agent yellow pages are very important part of the system, 
because they enable simple inclusion of new information 
sources into the system. Functionality of "yellow pages" 
should be implemented in form of an agent called Directory 
Facilitator (DF) [5]. Service registration requires a certain 
number of parameters that need to be given by registering 
agents [6]. These parameters are: 


e Descriptive name of the service, in our case the 
kind of search service. 

e Mandatory input parameters, accepted by search 
agent. These include type and form of data, which 
need to be passed to the search agent, when 
performing search. (Ex. Keywords:String) 

e Arbitrary input parameters, also accepted by 
search agent (Ex. language:String, 
DocumentFormat:String) 

¢ Output parameters, returned by search agent. They 
represent the type and form of search results, for 
example meta-data about found documents (Ex: 
documentMetaData:IEEE_ LOM) 

e Search agent address, which includes IP (Internet 
Protocol) address and port. 


After an agent registers its service to DF, the service 
description data is stored in its database and is from that 
moment on available to user side agents. When user side 
agents want to find an agent that enables a search service, 
they contact the DF and ask it about available search 
agents. DF provides them with service descriptions of all 
available and suitable search agents. User side agents can 
then check if then have suitable interfaces for available 
search agents and if so, they can contact them. The role of 
DF is presented in Figure 2. 


Figure 2: Directory Facilitator 


Agent authentication is also an important issue in a multi- 
agent system. We believe that a secure system should 
provide authentication [1,9]. Today, authentication 
mechahisms are standardised and widely used, so there is 
no need for development of new mechanisms. What we 
should do, when designing such a system, is implement an 
agent, that would enable automatic authentication by 
providing and checking digital keys and signatures to all 
agents in the system. In such way, agents on all layers of 
the system can check the identity of other agents. Last but 
not least, digital keys enable encryption of exchanged 
messages, which also increases the security in the system. 
Some cases of automatic verification and issuing of digital 
keys already exist [9,10], therefore their implementation is 
not a key issue for this system. 


B. Source layer agents 


As we have already mentioned, different information 
sources have different structure and query mechanisms. If 
we want to include many different information sources into 
a search system we must provide them with interfaces, 
which are on one hand specialized for each of the sources' 
structure, but on the other hand they offer a uniform access 
to the outside world. 

In our system, the functionality of these interfaces is 
presented in form of Source agents. Each of them can 
search its source (database) and provide information for 
user side agents in an uniform and understandable form. 


Da 
400 10” Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


Communication with other agents is performed by 
communication (KQML) modules, which are part of every 
Search agent. Communication is universal regardless of the 
structure and content of the information source. We are 
suggesting use of KQML language [12], which has a rich 
set of performatives that enable agent communication. 
Structure and functionality of source layer agents is 
presented in Figure 3. 

Source layer agents basically function in following way: 
When they are started they should get the authentication 
certificate, in order to be able to prove their identity to the 
rest of the system. This is only possible when the system 
enables authentication verification as discussed in chapter 
A. Furthermore, each of Source agents should register its 
functionality to so-called agent yellow pages at the 
intermediary layer, as described in chapter A and make 
itself known to the rest of the agent system. Then it can 
listen to search queries. When the Source agent receives a 
search query, it searches its database for related information 
(documents) and accompanying meta-data. It should not 
return only the found documents themselves (or their 
location), but also all meta-data about the documents. In 
such way, the user side agents get more information about 
the documents user is interested in, and can learn about 
user's habits. 


Source layer User system 
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Figure 3: Functionality of source layer agents 


We are aware of the fact that most of today's documents are 
not adequately described by meta-data and that today's 
information sources (search engines) are not returning nay 
meta-data at all. Suggested system enables inclusion of 
such information sources as well. Lack of meta-data is in 
this case ignored, but no personalization at the user side is 
possible, of course. Still, we believe that this philosophy 
should.be considered seriously and that information should 
be given adequate description(s). 


C. User side agents 


User side of the system uses the entire infrastructure that is 
provided by search and possibly other agents. In order to 
keep the modular concept of the system, interfaces for 
access to different search services should be used. In such 
way interoperability and use of a number of different search 
services can be guaranteed, without the need for change and 
adaptation of source code. These interfaces are 
communication modules, designed and adapted for 
communication with "its" Search agent, which is connected 
to one of the sources of information. All search interfaces 
are communicating with Search agents' communication 
modules, which must return search results to the main 
module in a universal format, regardless of source's 
structure or query language. As mentioned, suggested 
communication language is KQML and search results 
should be wrapped in them. 

Another important aspect of the user agent system is 
personalization. We can imagine the Search agent returning 
hundreds or thousands of search results, that is documents,- 
which correspond to users query. If these results 
(documents) are equipped with meta-data, we can order 
them in such way, that the documents, most suitable for 
user, are presented to him first. Furthermore, user profile(s) 
should be adapted, based on the information about user's 
areas of interest. This information can be obtained by 
tracking his interest through user profile (keywords used, 
documents opened and read, e-mail analyses...) [2,3]. 
Another benefit of this system's architecture is 
independence of personalization algorithms. No changes 
are needed if it is changed, upgraded or even left out. 
Graphic user interface represents user's access to the 
system. As we have mentioned it should enable tracking of 
user's interest and display of search results. There are many 
possible ways of implementation, either as a stand-alone 
application or through existing applications such as 
browsers for example [2,3]. 


An example of system implementation is described in next 
chapter. 


IV. IMPLEMENTATION OF SUGGESTED SYSTEM 


For the demonstration of described concept we have 
implemented a simplified version of suggested system. For 
the implementation we have used JAVA programming 
language. For agent communication we have used JATLite 
libraries product of the University of Stanford[11]. 

At the intermediary layer we have implemented the agent 
"yellow pages", represented by a Directory Facilitator 
agent. All service agents in the system have to register their 
capabilities at the Directory Facilitator. As we will see, in 
the demonstrational system the search service is represented 
by two Search agents. 

We have used two sources of information; one, containing 
links and meta-data about educational material and 
AltaVista search engine. The first was implemented as a 
relational database, accessed via SQL queries and the 
second. The other serves as an example of possible 
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inclusion of legacy applications into suggested agent 
system, but as such does not contain any descriptive 
information (meta-data) about the documents. Both sources 
are directly accessed by their Source agents, which are 
capable of performing source specific queries (SQL and 
HTTP-CGI), while offering uniform access to information 
at the same time. 
User side of the system is the most important part of the 
demonstrational system. It includes a number of agent 
modules. These are: 

e Main user agent 

e Search interfaces 

e Personalization module, with user profile(s) 

e Graphic User Interface (GUI) 


Main user agent integrates the work of all modules. It 
listens for user actions, coming through the GUI and passes 
search requests to search interfaces. When search interfaces 
return search results it passes them to personalization 
module, which performs personalization, based on user 
profiles. In the end it displays the results to user in a 
suitable way. 

Search interfaces are communication modules, designed 
and adapted for communication with "its" Search agent, 
which is connected to one of the sources of information. All 
search interfaces return search results to the main module in 
the same format, regardless of source's structure or query 
language. 

Search results are then be passed to the personalization 
module, which presents the results to user according to 
his/her preferences. Functioning of user agents of the 
demonstrational system is presented in Figure 4. 
Communication language used is KQML. 
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Interface 
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Figure 4: User agent model 


V. CONCLUSIONS 


A problem of finding the right information increases with 
the number of available documents. Their distribution 
among different information sources contributes to the 
complexity of search. Another problem is lack of document 


descriptions and it seems obvious that use of meta-data 
should become mandatory in the future. 

We have suggested a three-layer architecture of agent 
system, which enables inclusion of new information sources 
into the system through the mechanism of service 
registration and interfaced access to information sources. In 
the suggested system not only the document's locations 
(URLs) are returned, but also their meta-data, which 
enables personalized display of available information 
(documents) to the user. 

The suggested system was implemented in a simplified 
version, which contains the main agents and enables 
enhancements in terms of additional services and security. 
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Abstract—This paper describes the Cultural Journeys in the 
Information Society (CJIS) project. CJIS project is an INCO 
project (973324) funded by the ECC and its aims to address 
the problem of electronic information roads at various levels 
and.to develop a prototype system of such roads. The 
prototype will concentrate at first on cultural and historical 
information and the social relevance of such roads in 
education and training. This paper defines the notion of 
electronic information roads and explores the technical form 
which these roads can take. 


An electronic road is the user’s navigation path through a 
multimedia environment. More precisely, it is a series of links 
to the system’s Information Units (IU) the user chooses to 
follow through. [Us are the building blocks of the available 
information content and consist of the actual data (e.g. 
segment of video, image, sound or text) with an attached 
metadata index (semantic or system). The system produces a 
number of dynamic links that point the user to new 
Information Units. These dynamic links are based on the IU’s 
semantic nature and the user’s profile. 


Index Terms—Electronic Roads, Information Navigation, 
Search Engine, Meta-data 


I. INTRODUCTION 


Over the last few years there has been an explosion in the 
importance of research and development in multimedia 
systems and their deployment over the Internet and WWW. 
However, these systems usually have limited 
interconnections among them; they may be viewed as 
virtual territories with very few existing facilities that can 
help users explore and retrieve relevant information. They 
apply to discrete data spaces only, failing to provide users 
with a means of continuous exploration in the related 
information space that often spans multiple data spaces. As 
a result, uniformed users may be lost or retracted from their 
intended destinations if they can not describe a desired 
journey explicitly. Thus, there is a strong need to address 
this problem and to provide information guides that would 
allow a coherent exploration of information; an exploration 
that can be viewed as a structured journey in the available 
information. 


Search engines are used to gather a list of potential 
interesting sites in an effort to solve this problem. 


Additionally, human intervention for the update of 
interconnections is often employed. Both approaches have 
major disadvantages. The required time and effort for the 
manual update is both overwhelming and expensive while 
search engines often respond with a large set of sites that 
are only peripherally related to the actual query. A great 
amount of research is being invested towards the creation of 
more powerful and intelligent search engines. 


Electronic Roads can be thought of as the underlying fabric 
of the space of information and the Information Society at 
large. They shape this space and define a novel way of 
acquiring, using and exchanging information which 
ultimately can result in new social links between those who 
use these roads. 


An Electronic Road creates an information map with 
spatial, temporal and context or more generally semantic 
associations amongst different information elements. These 
electronic roads could be based on existing physical roads 
mirroring the information available in the physical area of 
the road or they could be completely thematic roads that 
traverse information that is semantically related but of 
different nature at different locations and time. Users who 
are interested in a specific topic would search for and take 
an appropriate Electronic Road. This will provide them with 
a traversal of a meaningful set of related information sites 
with each different user selecting a journey that suits their 
own interests and priorities in a dynamic way as they follow 
the road and discover new information that they judge 
useful. The user therefore is allowed or invited to explore 
the available information by dynamically selecting and 
forming a personalized information journey that suits their 
own needs and interests. 


A central challenging issue in the formation of these 
information roads is to define what indeed is the appropriate 
way to link, structure and organize information together. 
The available information needs to be structured in a way 
that supports flexible learners, rather than experts. For this 
reason every piece of available information in the system is 
associated with some metadata (semantic and system). 
Metadata can be maintained according to several standards 
such as the Dublin Core[9], XML[3] and RDF[6]. 


The primary requirement of an Electronic Road is to 
achieve a network of information in which the traveler can: 
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1. explore the information space in a natural way where 
the information offered remains continuously 
interesting (the system will provide dynamic links), 

2. adapt his/her exploration mode according to personal 
preferences both at the start and during the journey (the 
system will maintain a user profile), 

3. learn new information, particularly new relevant 
associations of her/his original domain of interest with 
other domains, 

4. offer as well as take information along his/her route, 
(the system will record his/her journeys), 


In order to address the above mentioned requirements a 
number of specific research problems need to be addressed: 


1. Develop adaptive and intelligent presentation 
multimedia systems. 

2. Build digital libraries for heterogeneous types of data 
and support sophisticated retrieval mechanisms [5, 2, 
13 

3. Examine the potential of state-of-the-art network 
technology (e.g. ATM, ISDN, new Internet 
architectures such as DiffServ ) and design reliable and 
effective networking algorithms [7]. Also refine search 
results based on current network and its state. 

4. Design and implement a metacomputing environment 
for amalgamating into one entity all the resources 
comprising an electronic road system. 


Il. CULTURAL JOURNEYS 


An electronic road is the user’s navigation path through a 
multimedia environment. It is a series of links to the 
system’s Information Units (IU) the user chooses to follow 
through. [Us are the building blocks of the system and 
consist of the actual data (e.g. segment of video, image, 


Cyprus Culture 


Independence Period British Period 


Fig. 1 Semantic Dictionary 


sound or text) with an attached metadata index (semantic 
and system). The system produces a number of dynamic 
links that point the user to new Information Units. These 
dynamic links are based on the IU semantic dictionary. This 
semantic dictionary has a hierarchical structure from 


general to specific. For instance, figure 1 shows a (small 
part of the) semantic dictionary for Cyprus Culture. 


A. CJIS Concepts 


We have the semantic dictionary (described above) and the 
digital library of the [Us holding the content. 


A secondary structure is that of an Information Element 
(IE) or NodeID which consists of a tuple of Information 
Units (IU) on the same subject. An IE is the total 
information /content in the different media that could be 
composed together to be presented to the user. 


As mentioned above, the building block of the system is the 
IU, which consists of the actual data (e.g. segment of video, 
image, sound or text) with an attached metadata index as 
depicted in the following table. 


Information Unit (seman 


TUID An identification of the IU Es 
Nodeld An identification of the information elements 

or nodes this can belong to 2 
Semantic The semantic dictionary nodes the IU 
Keywords belongs to. 


e.g. Cyprus Culture/Independence Period 
The group of users that might be interested in 
this IU. Users are classified by Age (12-18 _ 
and older than 18) and Depth of Interest 
(Low, Medium, High) ee 
Format i.e. text, image, video esas 
System Format __ It provides information about the format of 
resource which could be used to better utilise 
the networks resources. E.g. if the resource is 
a video segment it could provide the bit rate _ 


User Group 


Language the language the document is written e.g. 
English, Greek 

District Cyprus.Kerynia, Cyprus.Famagusta 

Period /Time e.g. pre-historic 


Other keywords _ e.g. Zenon 


The system also maintains a profile about the user’s initial 

preferences, records semantic metadata for every user’s 

stops and proposes a User Profile Record. The User Profiler 

consists of three main sub-modules: 

1. Initial User Profiler i.e. general information about the 
user and his/her preferences 

2. Recorded User Profiler. recorded user’s behavior i.e. 
User Previous stops. 

3. ProposeUserProfile. The system takes into 
consideration the information in 1 and 2 and returns a 
proposed User Profile record. 


The Presentation Unit (PU) is what the user sees. It 
presents an element of information (also called a node). 


Figure 2 shows a typical PU. This has the following general 
structure: 


a ____ 
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Static Links. These are links to the Semantic 

Dictionary of Context in a tree structure. 

Dynamic Links. The system produces a dynamic list 

of links according to user profile, and current IU 

metadata. 

e Primary links give extra information in the 
primary (if any) interests of the user (relatively to 
the current position on the semantic dictionary). 

e Lateral links change context to a “related” 
context. 

User Profile. The user can define (and redefine) 

his/her preferences and interests. 

Information Units: 

e Text. It contains textual content (of the current 
position’s node information element). 

e Image. It contains an image content (of the current 
position’s node information element). 

e Video. It contains a video (of the current position’s 


requirements and rearranges or modifies the list. As a 
result of the actions of System Refiner some 
information units might be omitted or have their rank 
changed. 

Link generator. This module takes as input the 
current’s position metadata and user’s profile and 
produces a list of related links to new nodes 
(information elements). These links could be either 
primary or lateral. 

System Based Refiner (Link). The output of Link 
generator is a list of links. This module considers the 
list of links, the bandwidth and QoS requirements and 
rearranges the ranking of the list. As a result of the 
actions of System Refiner some links might be omitted. 


6. PUs/Page Composer. This module takes as input the 


outputs of the System Refiner and Link Generator and 
produces the final Web document that the user will see. 
The system delivers a multi-frame document. The 


node information element). 


Also the user is able to search for information, save a 
journey or recall an old journey. 


rere tart 
SHEE 


Fig. 2, The Presentation Unit 


B. Electronic roads Functionality 


Every time the user chooses a dynamic link the following 
actions are taken by the system. 


1. User profiler. This module maintains a profile about 
the user from her/his initial preferences and records of 
the semantic metadata of the user’s previous stops and 
proposes a User Profile Record. The user’s profiler is 


based on the Proposal For An Open Profiling Standard 


[4]. 
2. Us Selector. This module takes as input the user’s 


next choice’s nodeJD and returns 3 ranked lists; one for 


each IU type. From these the system will select a 
subgroup in order to find the next presentation unit. 

3. System Based Refiner (IU). The delivery of 
information via a network needs to meet certain 
Quality of Service (QoS) requirements dictated by the 
user application [8].The System Refiner considers the 
lists of TUs, the available bandwidth and the QoS 
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document frameset is dynamic i.e. one or more 
Information Unit frames might be omitted or 
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Fig. 3 Electronic Roads State/Transitions 


C. A CJIS Scenario 


In general, the system produces and presents to the user 
links to Information Elements (IEs). The user chooses at 
each point a link and this forms a road. The journey in 
figure 4 is based on the Semantic Dictionary presented in 
figure 1. 


The roads, the system produces are initially based on the 
Semantic Dictionary (SD). Assuming, we are starting a 


journey from the root of the SD (i.e. Cyprus Culture) the 
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system will display information from the associated IE and 
links to other IEs whose Semantic Keywords (see metadata) 
are associated with that node of the SD. The primary. links 
to this point will be Independence Period and British 
Period. If the user selects the Independence Period road, 
then again the system will display the associated IE to this 
node. Primary links of this will be Architecture, Social and 
Political, and Lateral links will be British Period. Similarly, 
if the user chooses Political, then the new PU will have 
primary and lateral links as shown below. 


The above scenario was kept very simple, in order to 
demonstrate how primary and lateral links are produced and 
did not take into consideration other parameters that 
contribute to links dynamism, such as user profile and 
system metadata. 


Cyprus Culture 
Primary Links: 


Independence Period 
British Period 


Independence Period 
Primary Links: 
Architecture 


Lateral Links: 


Lateral Links: 
British Period 


Lateral Links: 
Architecture 
Social 


Fig. 4 A Cultural Journey 


Ml. IMPLEMENTATION ARCHITECTURE OF CJIS 


The software platforms that we will use are Java, 
JavaScript, Java RMI, JDBC, SQL DBMS, Dublin Core 
and hardware platform is Windows NT. 


1. User Interface Agent. It will run through a web 
browser and will consist of two frames: 

e Frame 1: Electronic Roads. This is a Java Applet 
that will get user’s input and interface with the 
main ER system that will be running on the server. 

e Frame 2: IU Content. This is the IU in an HTML 
format loaded from a URL address. 

2. Server Site. This is a Java Application. This is where 
the ER functionality is encapsulated. It is responsible to 
respond to the users. It also access the Database 
Management System. Use RMI and JDBC. 


3. IU and Metadata storage and retrieval. [Us will be 
stored as HTML documents. [Us metadata will be 
stored in an SQL DBMS (e.g. Microsoft Access) 
according to Dublin Core standard [9]. JDBC will be 
used to access the metadata from the main server Java 
application. 

4. Client Server Communication. Java RMI will used 
for the communication needs between the client and 
server. RMI is a simple and abstract but powerful 
enough for the communication needs for our system. 
Other communication alternatives are also investigated 
such as Mozart and MANIFOLD. 

5. Hardware Platforms. The client site is platform 
independent. The server Java application will be 
implemented and run on Microsoft Windows NTs. 

6. Network. We will consider an Internet and Internet 
architecture such as DiffServ, and ATM. 


ITV. CONCLUSIONS 


In conclusion, this paper describes the Cultural Journeys in 
the Information Society (CJIS) project. This project aims to 
address the problem of electronic information roads at 
various levels. The paper defines the notion of electronic 
information roads and explores the technical form which 
these roads can take. 
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The Notion of Subband Sampling 


A Proposition for the Construction of Extra High Value Bi-Directional Communication System 


Budi Rachmanto*, Ph.D. Student, Non Member, and Hibino Yasushi** , Member, IEEE 


Abstract 

Next generation of multimedia communication should give better 
sence of presence, as well as picture and voice quality. Or the other 
hand, transporting data streams with highest quality losslessly over 
a network is not an easy task. Data quantity is huge and difficult to 
handle. Realtimeness constraint of interactive communication only 
permits us a very short delay with no 'resend' policy. Yet, network 
resources is limited, and (lossless) realtime compression is still 
needed to meet with bandwidth constraints. 

This paper describes the outline of constructing communication 
system with extra high quality data streams, focused on subband 
sampling, a reviewed method for intraframe coding. 


Index Terms 
interframe coding, intraframe coding, subband sampling, 
lossless coding 


Introduction 

As an extension of the so-called High Definition Television 
(HDTV), the Extra High Quality CEHQ) communication 
system would likely provide much better quality, and should 
also give the possibility of realtime interactive 
communication, such as tele-conferencing, as contrasted with 
current HDTV which only provides one-way (broadcast) 
communication. On the other hand, from the point of view of 
network designers, transporting data streams with very high 
quality over a network (without losing any information 
contained) is not an easy task. The quantity of data will 
become very huge and very difficult to handle. Realtimeness 
constraint of interactive communication only permits a very 
short delay (at 150 ms one way) and does not permit any 
‘resend’ policy. Also, lossless transportation is highly 
preferred, so we need a kind of protection mechanism. Yet, 
network resources is limited, and (lossless) realtime 
compression is still needed to meet with bandwidth 
constraints. 


Video data is the component mostly characterizing an extra 
high quality stream. For CD quality audio, a bandwidth of 
44.2 kbps is considerably sufficient. Meanwhile, sending 
progressive scanned and uncompressed moving pictures with 
twice the height and width resolution of current TVs, 36 bpp, 
60 fps, will require minimal bandwidth of 20 Gbps. Being 
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compressed, however, it will introduce a highly fluctuating bit 
stream. Accordingly, to make a full and effective use of 
available transportation channel, it needs to consider how to 
send a data stream efficiently with no quality drop. 


We will first define the outline of constructing communication 
system with extra high quality data streams. Then, focused on 
the idea of subband sampling used for intraframe coding, we 
propose a codec construction to deal with stringent 
characteristics of extra high quality picture samples: 


Objectives and Concepts 

Our objective is to build a bi-directional communication 
system handling extra high quality data stream. The primary 
target is to send the huge data losslessly, at the same time 
efficiently, and the next is to reproduce pictures as nearly as 
the originals if severe troubles occurs on the transporting 
channel. The idea is that, we do not intend to compress the 
data as small as possible to be sent to the receiver(s). Instead, 
while assuming that transportation channel is limited and not 
perfect (i.e. vicious errors might occur), we purpose to deliver 
it without losing anything contained. 


ource . etwork eceiver 


Fig 1. System Diagram 


First, the stream itself might include audio and video data. We 
focus on the video data element, as it is the most dominant 
component. Each frame is coded by subsampler. For pictures 
with smooth degradation, components of higher frequencies 
are expected to have almost zero values, requiring fewer bits of 
encoding. Secondly, for the inter-frame coding, applying 
DPCM coding introduces shorter processing time and no 
frame delay. On the other side, frame flipping method implies 
longer processing time and frame delay. However, it gives 
more flexible options of inter-frame compensation. To put up 
with burst errors, we scatter the high priority frames (or 
subband components) within allowable period. 


Our basic concepts end here. However, a way should be 
considered to send data stream in case of network changes. 
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First, it needs to equip an error correcting mechanism, to 
overcome errors on the communication channel. Second, in 
case the network deteriorates in such the error correcting 
mechanism fails, subsampling gives the ability to reconstruct 
picture by composing the lower frequency elements. Third, ifa 
frame vanished, inter-frame compensation can be performed. 


Extra High Quality Picture 

As in [1]. we use specially scanned images with color depth of 
36 bpp and spatial resolution of 1600x1200 pp. as still picture 
data. Image night shows a picture of evening scenery. The 
other image is cooler, a snap photograph showing a table full 
of papaya, wine bottle and glass, and a trumpet. 


We use DV stream output from a digital video camera as 
sample data for moving pictures, because EHQ movie still 
does not exist presently. The camera interlacedly scans 60 
times a second, composing 30 frames. Color signals are only 
sampled once every 4 pixels. Inter-frame processing is not 
performed inside the camera. The size of each frame is 
720x480 pp., with color depth of 24 bpp. An uncompressed 
RGB frame thus needs approximately 1 MB to store. 


Encoder 


Fig 2. Encoder and Decoder 


RGB/YCrCb Conversion 
There are correlations between color signals, as described in 
[1]. Instead of processing RGB signals, we use 


illumination/color signal representation (as widely used in the 
world of television technology), because it is easier to handle if 
the signals are converted to the illumination/color signal 
representation. A picture with strong color correlation will 
have concentrated signal distribution, resulting in easier 
coding. 


System Structure 

The entire system construction consists of encoder, 
transportation channel, and decoder. Evaluation is done by 
realization of transportation channel simulator including 
network error parameters (bit error, cell loss, jitter). 


Inter-Frame Processing 

For a highly polluted DV stream, we equipped a provisional 
interframe noise filter. If in the future a camera with very low 
noise makes its debut, this filter will be unfiecessary. 
Considering the ability of frame compensation, frame flipping 
is preferable to DPCM, if additive processing time and frame 
delay are permitted. Also, time interval of initial frames can be 
adjusted, according to the network situation or the 
characteristics of the movie itself. 
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Fig 3. DPCM vs. Frame Flipping 


Fig 3 shows the effect of both DPCM and Frame Flipping.,. 
Frame flipping implies more fluctuated bitstream, however its 
average is not so different with that of DPCM. We concluded 
that if longer processing time and frame delay are permitted, 
frame flipping provides a better choice. 


Lossless Coding (Compressor) 

Here, no intentional information loss is permitted. 
Subfrequency streams flushed out by subsampler will be coded 
using a lossless compressor, which codes the stream not by 
utilizing the characteristics of the picture, as we have already 
used it at the pre-processors. 


Protector and Packetizer 

To cope with burst, bit errors or cell losses that might occur on 
the transportation channel, we equip the system with Error 
Correcting (Reed Solomon) Coder and bit interleaver. Data 
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stream is packetized and delivered over the network to the 
communication partner. Considering the characteristics of the 


transportation channel and switches, packet is chosen to be of 


a fixed length. This will reduce switching time. 


Intra-Frame Coding: Subband Sampling 

Subsampling is used as the intraframe coding method. Even in 
case that the higher frequency layers are damaged, the picture 
can be recomposed by using the lower frequency layers. 


Each picture is divided into several subfrequency componests. 
Except the lowest frequency component (which has the 
highest priority), output stream value of higher frequency 
samples are the differences between the real and the estimated 
values calculated based on lower frequency sampling values. 
Compression is done by eliminating redundancies of the 
sampled values. 


Fig 4. Zig-zag scan 


Instead of using classical zig-zag scan on a partitioned frame, 
we prefer two-dimensional scanning on sample prediction 
using Langrangian inter/extrapolation. 


P ce 
we 


predicted sample 


Fig 
5. Two dimensional scanning 


Sampling prediction is done by cross-scanning previously 
sampled data (i.e. lower frequency layers) around the point 
that needs to be estimated. It can be diagonally scanned, or 
vertically and horizontally, depends on the position of the 
estimating point. 


Fig 6. Sampling frequencies 


Figure 6 describes the frequency levels of subsampling. 
Number 0 (zero) means the place of dots associated with the 
lowest frequency (i.e. the most crucial frequency layer that 
must have the highest priority). Here number 6 indicates the 
places of the dots with the highest frequency number. 
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Fig 7. Sampling sequence 


Figure 7 describes the sampling sequence of subband 
sampling with maximum sampling step of 8. Complementary 
with layer segmentation described below, here we use a 
“fractal’-like sequence in order to make it more robust towards 
the transportation errors. 
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Fig 8. Segmentation of frequency layers 


A frame is decomposed into several sublayers (subfrequency 
or subband layers). Furthermore, each of them is then 
fragmented into one or more segment(s) before paralelly sent 
to the next (lower) processing layers. 
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Fig 9. Maximum differential value of each frequency 
element (cooler) 


Examining extra high quality image data brings a problematic 
result. The highest frequency component still contains 
significant differential values and cannot be ignored. The 
information quantity is still very big, and lossless compression 
on the highest frequency component does not give better result 
than about 50%. We can see here that, the sharper the picture. 
the bigger the discrepancy of estimated value. 


Table 1 Subsampling result on an extra high quality image 
(Cooler) 
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Table 1 shows (on its priority order) information quantity 
contained in each frequency component of a subsampled extra 
high quality picture. Namely, f0 is the lowest frequency 
component having highest priority, and f12 is the highest 
frequency component with lowest priority. Except the lowest 
frequency f0, value of each frequency sublayer is described by 
differential value between the real and the estimated value. 
The highest frequency component holds the most part of the 
information quantity. Reducing information quantity up to the 
quarter of the total does not imply a significant quality 


degradation However, although absolute differential value 
gets narrower at the highest frequency, it is unavoidable. 


Note that, in case of interframe coding, frames containing the 
most important data (I frames) have the biggest information 
quantity, while in case of subsampling (intraframe coding). 
the component containing the most important data (ie. the 
lowest frequency component) has only a few. Also, the priority 
of Y (illumination) signal has to be set higher than that of 
colo 


ae better appare quality. 


Fig 10. Subsampling result on EHQ picture (cooler), left: 
original, right: only 1/256 of color signals received. 


Realtimeness Concerns 

Practising a high quality realtimeness needs end-to-end delay 
to be kept below 150 ms. Assuming that transportation 
channel spans about 10000 km. propagation and switch delay 
can be assumed around 50 ms. Thus. the delay available for 
both encoder and decoder is only 6 frames period. By 
assuming encoder is twice slower than decoder, the total delay 
permitted on the encoder and decoder are only 4 and 2 frames 
period. Depend on the condition of the transportation channel. 
it needs to adaptively change the temporal or spatial 
resolution. Realizing this mechanism needs building a 
movement detector to be equipped into the system, deciding 
which parts are static and which are dynamic inside the 
stream. 


Summary 

Focused on subband sampling method used in intraframe 
coding, we introduced a definitive policy to constructing a 
bi-directional communication system to deal with extra high 
quality data streams. A simulation program built with 
Glade/Gtk is used for evaluation. 
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The PUMMAS) Interactive Environment: 
An Internet Service Coupling 
Web Technology and HPCN 

to Analyse Conformal Microstrip Arrays 


Andy Marsh, Dimitra I. Kaklamani, Member, IEEE, Hristos T. Anastassiu, Member, IEEE, and 
Phridon Shubitidze 


Abstract-- Using High Performance Computing (HPC) as a 
remote service over the Internet is now becoming a realistic 
possibility to investigate and visualise Computational 
Electromagnetics problems. Additionally, the pragmatics of 
their utilisation can be abstracted by adopting a World Wide 
Web (WWW) interface. In the present work we analyse the 
behaviour of conformal microstrip patch antennas for aircraft 
communications. Microstrip patch antennas consist of a thin 
dielectric slab (usually called substrate) covering a 
rectangular metallic plate-(ground plane). The radiating 
element of the antenna is a metallic patch located tangentially 
to the upper surface of the dielectric substrate, fed by an 
excitation implemented as a microstrip line or a coaxial cable. 
Due to their small size and weight, microstrip antennas, often 
clustered in arrays of several elements, are used in airborne 
communication systems. Since these arrays are usually 
positioned on the aircraft outer surface, they have to be 
conformal to the local portion of the latter. The way the 
composite antenna radiates is described by a _ three 
dimensional graph, called the far field pattern. The energy 
radiated to a given direction is proportional to the value of the 
graph in that particular direction. Protruding lobes of the 
graph correspond to high radiation directions. This paper 
concentrates on using a Web-based environment, not only for 
the connectivity, but also for the supportive tools of data 
entry, program initiation, result visualisation and even 
interactive modifications of the geometry and/or the 
electromagnetic properties of the examined problem. 


Index terms-- WEB, VRML, Conformal Arrays 


I. INTRODUCTION 


Nowadays, High Performance Computing (HPC) provides 
the potential to reduce the execution times needed to solve 
large size electromagnetic (EM) problems. In addition, the 


The authors are adjuncts to the Institute of Communications and 
Computer Systems, National Technical University of Athens, 9 Iroon 
Politechniou Street, GR-15780 Zografos, Athens, Greece, Tel.: +301 
7722277 ~=Fax: +301 7723557, E-mail: marsa@phgasos.ntua.gr; 
dkaklam@cc.ece.ntua.gr. 


growth of the World Wide Web (WWW) supports an 
interconnection network that, not only promotes the remote 
accessibility of the HPC platforms, but also provides a 
homogeneous environment for data entry and results 
visualisation. Electromagnetic parameters, related to an 
antenna problem, are often difficult to conceptualise due to 
their abstract nature, as well as their functional dependence 
on more than one variable. Therefore, efficient visualisation 
facilitates the fruitful processing and utilisation of the 
problem results. 


The goal of the Parallel Modified Method of Auxiliary 
Sources, P(MMAS), solver is to support an interactive 
Virtual Reality environment, whereby the user can define 
an EM problem, visualise the results and then interactively 
modify the geometry and/or the EM properties of the 
problem. The first step towards this goal is to utilise the 
WWwW for data entry, to initiate the EM solver and then use 
VRML to visualise the results. 


The first objective and work presented in this paper is to 
utilise web technologies to create an interactive integrated 
environment to solve the radiation problem of a patch 
antenna array, which is conformally printed on a cylindrical 
surface. In section II the P(MMAS) EM solver is 
demonstrated and in section III the format of the EM results 
is presented. Section IV shows the prototype P(MMAS) 
Interactive Environment and in section V the on-going and 
future work are discussed both in terms of the Interactive 
Environment and the EM solver. 


II. THE P(MMAS) EM SOLVER 


The geometry of the problem is depicted in Fig. 1. A 
metallic cylindrical surface bears an array of patch 
antennas, fed by microstrip lines or coaxial cables. Each 
antenna consists of a radiating element (conducting patch) 
located on top of a dielectric substrate. Similar structures 
are widely used in aircraft and spacecraft communications, 
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where the patch antennas are usually borne by the fuselage, 
wings, etc. 


: Metallic 
Microstrip cureice 
line feed 

Patch q : 
Dielectric 
substrate 


Fig. 1 : A patch antenna array conformal to a cylindrical 
surface 


To analyse the radiating behaviour of this conformal array, 
the Modified Method of Auxiliary Sources (MMAS) [1,2] 
is employed. Specifically, a set of radiating, fictitious 
current sources is set inside and outside the structure, on 
auxiliary surfaces, conformal to the actual ones. The values 
of the unknown weights of the source currents are 
calculated by imposing the appropriate boundary conditions 
at a discrete number of collocation points on the physical 
surfaces, and by solving the resulting linear system of 
equations. As opposed to the standard MAS [3]-[6], where 
the unknowns are the source elementary fields, the 
unknowns invoked in MMAS are the source currents. 
Additionally, all source points, as well as the collocation 
points, are placed on the nodes of canonical grids. These 
two choices enable the avoidance of high order singularities 
in the field expressions, thus allowing the simulation of 
very thin and open structures. 
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Fig. 2 : Grid and unknown quantities in modified MAS 


The basic idea of the described technique stems from the 
Finite Difference Time Domain (FDTD) method, since a 
similar grid scheme and derivative equations are applied to 
solve for the charge density, as shown in Fig. 2. Applying 


the proposed modified MAS and assuming an exp(-lat) 
time dependence, the tangential to the physical surface 
electric field is calculated in terms of the vector potential 
A and the scalar potential ©, as 


“be = I@A tan —(V@) tan (1) 
7,.ikR ikR 
ri Ho Je ' 1 pe ' 
‘ A=— ————as — ax ———ds 
A ats [ R 4m, I R @2) 


where S*” is the auxiliary surface. Moreover, by applying 


the current continuation law i@p = V-J, the charge density 
p can be expressed in terms of the J, and J, tangential 


current components, as 


lpi PL LOsEO ee (3) 
and the surface integrals in (2) are replaced by finite sums. 
A canonical grid, similar to the one shown in Fig. 2 is 
developed on the physical surface S and the gradient of the 
scalar potential @ is similarly calculated in a discreet way. 
Namely, if the indices (p,q) are assigned to a point on the 
physical surface S (corresponding to the auxiliary point 
(i,k) on S*”), the tangential components of the electric field 
on S are approximately equal to 


‘ (p-i.q) ~ Pep, 

Ey(p-12,9 =A yp, — ne ee (4a) 
, P(p,.q+1) — P@, 

Exip.qei2) = iA ypqe12) - 2 An a (4b) 


Therefore, since the V operator is not passed into the 
integrand, according to the present formulation, the 
singularities to be faced are limited to order R™ and, thus, 
numerical instabilities are avoided. 


A FORTRAN77 code has been developed for the numerical 
application of the technique described above, the efficiency 
of which has been enhanced through parallelisation [7]. 


Ill. EM RESULTS 


The results of the analysis include all characteristic antenna 
parameters, which may be single-valued, such as the input 
impedance, or functions of the co-ordinates, such as the 
antenna current and the radiation pattern. The latter shows 
the dependence of the antenna far field on the direction of 
observation. Since this direction is defined by two angles 
(elevation 6 and azimuth @), the resulting plot is three- 


dimensional, graphically casting a fictitious surface in 
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space. The radiation field is stronger in the surface regions 
lying away from the origin (lobes) and weaker when lying 
close to the origin (nulls). It is advantageous to the code 
user to visualise the far field pattern, when it is 
superimposed to the actual antenna geometry. In this way 
the user can have a global perception of how the 
geometrical features of the antenna affect the radiation 
pattern. Furthermore, the user may alter several input 
parameters interactively, watch the changes materialise on 
the screen, and finally examine the effects on the radiation 
pattern. 


IV. THE INTERACTIVE ENVIRONMNENT 


To facilitate the homogenous accessibility and abstracted 
usability of the P(MMAS) EM solver, an interactive 
environment, as depicted in Fig. 3, has been developed. 
Based on web technologies, the user from any compatible 
web browser can define the problem geometry (Fig. 3-(2)) 
and initiate the solver (Fig. 3-(3)). The solver has been 
optimised to execute on a parallel-computing platform (SGI 
ONYX with 256 Mbytes 2-Way interleaved main memory 
and 4 x 200 MHz IP19 processors; R4400 MIPS CPU and 
R4000 MIPS FPUI). On completion the server will inform 
the user (Fig. 1-(4)) the ftp location of the results. The user 
can, then, access and view (Fig. 3-(5)) the far field results 
via any compatible web browser. The user can view the 
currents, geometry or far field pattern, either as data files in 
ASCII or a 3-D representation in VRML 2.0 format. As 
shown in Fig. 3-(5) the geometry (LHS) and far field 
pattern (RHS), which can be gyrated independently, are 
combined onto a single screen. Each is presented in its own 
3D co-ordinate space with a pointer depicting an angle of 6 
and g. The user can manipulate the pointers by adjusting 
the slide bar values for @ and q. The user can also visualise 
the currents as a 3D representation, which are hyperlinked 
from the geometry. 


V. FUTURE WORK 


The rationale of this paper is to illustrate how parallel 
computing can be used as a computational tool in solving 
very large electromagnetic problems. This is especially the 
case when inherently parallel algorithms, such as MMAS 
are used. Additionally, the paper wishes to promote that this 
remote usage of computing resources is also being made 
possible by using the newly established high-speed 
telecommunication networks and web technologies to 
support a homogeneous interface. The objective of the 
authors’ current activities is to demonstrate the capabilities 
of the interaction between advanced visualisation 
techniques, including stereoscopic imaging and immersive 
VR, remote computing networking and advanced EM 
solvers. At present, the results in VRML format can be used 


to present a 3D far field model. The VRML 2.0 format 
could also be used to introduce a temporal dimension and 
WWW supported facilities, namely audio, could also be 
used to enhance the results presentation by using 
sonification. 
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Abstract—This paper presents two simulation methed af a flat 
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Numerical results are presented for particelar valees of the 
cartier frequency and mobile speed. 


Ingex Term:—monirequency selective Rayiesh fading, 
Tandem segmence Senerators, shaping filters 


L INTRODUCTION 


tramsmitied siemal through the channel due to the existence 
of multiple paths The ister ws related with ihe time 
Variions im the matere of the mmitpath Fach paih bas 
assocaated am aiiemaaiion factor and a propagation delay. 
wiuch are tome varamts. The Ravicigh fangs Game! 1s 
typacal for the heavily wha areas, when the LOS 
component is blocked and i 1s wilely accepted for the VHF 
and UHF chaamels. The received signal is the sum of a large 
numnber of phasors. Thea chanses m amplitude and phase 
ae random: comscqucaily, the received sigmal cam be 
modeled 2s a random process. For a lame mumber of paths. 
the contral lnmat theorem can be applied and the equivalent 
lowpass reccived sigmal cam be modeled as 2 complex 
valued Gaussian process. The envelope of the equivalent 
lowpass received signal has a Rayleigh pdf and the phase a 
waiform pdf ower the range 0 to 2x. 


Im thes paper @ 5S comsidered only the case of a non 
frequency scleciive and Rayleigh fadime mobik radio 
Channel it cowkd be considered that an unmodulated carrier 
iS Wansmatied becamse the propagation phenomenm: don't 
Gepead om the ratio of the symbol duration to the mukipath 
Spread of the Channel 


**Wireicss Commmmications Research Group. Department 
of Electromic Systems, University of Westminster _115 New 
Cavendish Street, London. WIM 8J3S_ UK. 


The equivalent lowpass diffuse component can be 
represented by emphasizing the attenuation factor and the 
propagation delay of cach peth component or the Doppler 
effect. as follows 


rét} = ya, (ne! im@® = UP: a ‘ia @D 
: = 


where a, () is the attemmation factor of the Ath component. 
r, (2) 1s the propagation delay of the Ath component. 

f. the camer frequency. g,(a complex valued random 
variable and ¢, =2z f_,t- 

The 4th component. coming at an azimuthal angie ¢, with 
respect to the direction of movement, has a Doppler shift- 
= 


¥ 

ican 
where crepresents the speed of light and v the vehicle 
speed. The complex envelope of the received signal can be 
also written m tems of the mphase and quadrafure 
components of the received bandpass signal as- 

rH) =7O+7O 
Zere Mean and variance equals to: 

var.) = var) =o" =R, O=4 
where 2q is the mean power of the received signal. 
The power spectrum of the received unmodulated signal, m 
Oe pa ee ee 

SWM= me 


eT) 
als>s 


The correlation function of the complex envelope is given 
by: 

R(A) = F [S.A = 20°, (@, AN = AR, (At) + JR, (AN) 
where ,,(_) is the first kind zero order Bessel function. 

The uncorrelation between the two components results m 
R_.(At)=0. VAr_ Consequently, the imphase and 
quadrature components are characterized by the same 


J, OSG, 


R, (A) =R, (A) =0°J,(@,A) 


416 8 =©——<“‘i—i‘i‘Ci:‘C:™C::””..___ 1 Minditerrancain Electrotechnical Conference, MB IeCon 2000, VoL 


Q-7803-6290-X/0/$ 10.00 ©2000 IEEE 


The received signal is completely characterized by the 
complex envelope and the carrier frequency. 

The equivalent lowpass received signal is the response of 
an equivalent lowpass channel to the equivalent lowpass 
transmitted signal. The time variant impulse response of the 
equivalent lowpass channel is given by: 


A(t, t) = Ya, (eS (t=7,(0) 
k=) 


The impulse response of a flat Rayleigh fading mobile radio 
channel can be considered as a single delta function whose 
weight has a Rayleigh pdf. This occurs because all the 
multipath components have negligible delay spread 
between them. We simulated the complex envelope of the 
received signal, which is equivalent with the simulation of 
the complex envelope of the impulse response of the flat 
Rayleigh fading channel. This can be done by generating 
the in-phase and quadrature components of the received 
signal, with given second order statistics, or the 
envelope p(t) and the phase (1) , which can be written as: 


p= r+? 


-(t) 
Q(t) = arct 1) 
are. “( (t) 


r 


II. SIMULATION METHODS OF THE FLAT RAYLEIGH FADING 
CHANNEL 


We present two simulation methods: one of them starts from 
the given correlation function, R,(Af), and the other one 
from the power spectral density function, S(/). 

The simulation scheme is presented in figure 1. It illustrates 
the method of generating the equivalent lowpasss received 
signal from the in-phase and quadrature components. The 
envelope and the phase can be computed, as shown in 
section I. The sequences applied at the inputs of the filters 
are Gaussian distributed and uncorrelated, with zero mean 


and variance o”. The sequences from the output of the 
filters are also Gaussian distributed, but correlated. 


Fig1.The simulation scheme 


The Simulation Method Starting From The Power Spectral 
Density Function 


When white noise with power o2 is passed through a 
stable linear filter with the system function H(q@), the 


power spectral density function of the resulting process is 
given by: 

S,(@) = o2|H(@)| 
The filters form the signals accordingly with their 
Doppler power spectrum. It can be noticed that the shape 


of the power spectrum depends only upon the magnitude 
of the filter frequency response. 


A. The Simulation Method Starting From The 
Correlation Function 


Starting from the correlation function the filter 
coefficients can be computed using the modified Yule- 
Walker equation [6]. In the AR case, the Yule-Walker 
equations become the normal equations, which can be 
solved with the Levinson-Durbin algorithm. The key of 
this algorithm is the recursive computation of the filter 
coefficients, beginning with the first order and increasing 
the order recursively, using the lower order solutions to 
obtain the solution to the next higher order. 


III. THE RANDOM SEQUENCE GENERATOR 


A correlated sequence is obtained from an uncorrelated one. 
Consequently, the better the performances of the 
uncorrelated sequence generator the better the performances 
of the correlated sequence generator. For each generator it 
is necessary to test the periodicity of the generated 
sequence, the correlation between the sequence samples, 
the pdf of the sequence and the parameters of the pdf. The 
non-parametric tests, which refer to the distribution, are 
split into two categories: empirical tests and theoretical 
tests. The empirical tests use ordinary statistics in data 
processing. The result is compared with a tabulate value 
and from this comparison result a conclusion about the 
performances of the generator. Different generations can 
produce different results. It is necessary to compute an 
average of the results from different generations. If the 
result obtained from one of the generations rejects the 
hypothesis of a given pdf, the generator is considered 
“suspect” One of the most widely used empirical tests, 
applied to the discrete distributions, is the y’ test.[9] 

The Gaussian distributed sequence from the input of the 
filter is generated starting from a uniform distributed 
sequence. We used the uniform distributed sequence 
generator within Matlab programming medium. 

For a number of 10,000 samples and 25 generations, the 
computed average from the results of the y’ test is 5.7, 
close to the tabulate value for a 25% trust level, which is 
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equal to 5.899. The conclusion is that the performances of a 
Rayleigh fading channel simulator are better if the uniform 
distributed sequence generator, on which is based the 
Gaussian distributed sequence generator, is more 
performable. 


0 100 200 300 400 500 


mz 
IV. RESULTS Fig2.The magnitude response degree 
A flat Rayleigh fading channel is simulated using the two degree 


methods presented in section II. The input data are: the 

carrier frequency equals to 1.5GHz, the vehicle speed of 
80Km/h and the variance equals with unity. In the first case, © 
we consider digital IIR filters and in the second, AR(p) 

models. 


A. The Simulation Method Starting From The Power 
Spectral Density Function 


The IIR filters are designed using the indirect method. The if es 
design of the analog prototype filter is made with the pag <i 
Butterworth, Chebyshev type I and II and elliptic DE. Ga ot an 
approximations. The filter order is computed from the i an 
condition of a minimal order that fulfills the imposed 
specifications. The minimal order obtained with the 
Butterworth approximation is 12, with the Chebyshev j ie 
approximation is 5 and with the elliptic approximation is 3. 4 ee ee 

A lower value of the filter order results in fewer elements 415 1 05 0 OS 1 
within the internal structure of the filter. The paid price is a 
larger distortion of the group delay. The transformation of 
the analog filter transfer function is performed by the 
bilinear transform. This method assures a_ steeper 
attenuation characteristic comparing with the impulse 3000 

invariance method, but the paid price is a larger distortion 2500 J 
of the group delay. In this application of the digital filters 
only the magnitude of the transfer function is important, not 
the phase. The stability of the digital filters is checked using hay 
the zero-pole analysis. 1000 


Imaginary part 
& 
om 
x xxx 


Figures 2 - 8 refer to the case of the Butterworth i 
approximation. Similar results were obtained for the 

another approximations. Figure 2 presents the magnitude Fig5.The histogram of the output filter sequence 
response versus frequency, figure 3 the phase response 

versus frequency, figure 4 the zero-pole diagram, figures 5 

and 6 the histograms of the sequences from the output of a aC) Wee es 

the filters, figure 7 the histogram of the envelope sequence alk. 
and figure 8 the histogram of the phase sequence. It can be 2000 
noticed from the histograms of the sequences from the 
output of the filters that the mean is equal to zero, which is 
the desired value. 


Fig6.The histogram of the output filter sequence 
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Fig8.The histogram of the phase sequence 


B. The Simulation Method Starting From The 
Correlation Function 


The AR simulator is composed by two different parts. The 
former consists in the estimation of the model parameters 
and the later in the generation of the complex valued 
gaussian process, with a given correlation function. The 
number of the correlation function samples equals with the 
order of the filter plus one. The higher the order the better 
the approximation, but this determines a larger amount of 
calculation. We present the results for a sampling period of 
10ms and a filter order of 10. The stability of the filter is 
determined with the pole-zero diagram. If the filter order is 
acceptably increased, the filter is still stable. The 
distributions of the envelope and phase were tested with the 
histogram method. The theoretical correlation function of 
the in-phase component is compared with the estimated 
correlation function, computed on the expression: 


rl = P 
R= Fa eli +e 


The two functions are represented in figure 9. Similar result 
is obtained for the quadrature component. The model is 
suitable for a number of sample equals to the filter order 
plus one. The period of simulation increases with the filter 


order. The y’ test can be also applied to a Rayleigh 
distributed sequence. However, in this case, the sequence is 
correlated, which implies the application of the test only to 
the uncorrelated samples. 


*estimate 
+ teorethic 


ample 
umber 


Fig9.The correlation function for the in-phase component 
sequence 


V. CONCLUSIONS 


The simulation of a flat Rayleigh fading mobile radio 
channel consists in the generation of the complex envelope 
of the received signal when an unmodulated carrier is 
transmitted. The performances of the uncorrelated Gaussian 
distributed random sequence generator and the design of the 
filters determine the performances of the flat fading mobile 
radio channel simulator. 
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Maximum periodic correlation of pseudo-random 
sequences in CDMA 


Mitja Stular, Student Member, IEEE, and Saso Tomazic, Member, IEEE 


Abstract— The article presents the comparison of the 
Welch, modified Welch, Sarwate and Sidelnikov lower 
bounds on the correlation of CDMA codes. It is shown 
that the modified Welch bound is the best general bound 
known, while for codes whose symbols are roots of unity, 
the Sidelnikov bound is tighter. It appears likely that these 
bounds are nonoptimal for larger number of codes in a code 
set, and that there is still place for further improvements. 


Keywords— CDMA, code set, 
bound 


correlation, theoretical 


I. INTRODUCTION 


N code division multiple access (CDMA) systems, 

pseudo-random. sequences, i.e. codes, are used in or- 
der to distinguish between signals of different users. For 
an ideal CDMA system, two requirements should be met: 
to prevent signals of different users to interfere with each 
other, codes should be of zero crosscorrelation, and, for 
successful synchronisation to the received signal, the auto- 
correlation of each code should be pulse-like with nonzero 
value at the origin and zero value otherwise. It is proven 
that both requirements can not be met simultaneously. We 
briefly discuss this in Section II. 

If an ideal code set does not exist, it is important to know 
what compromises are feasible. Sidelnikov, Welch and Sar- 
wate have analytically derived theoretical lower bounds on 
how small the crosscorrelation and autocorrelation can si- 
multaneously be. The results are described in Section III. 

In Section IV, we compare the bounds in several aspects. 
The results are instructive, and give us some ideas for fu- 
ture work. 


II. IDEAL CODE SET SELECTION FOR CDMA 


In this section, we discuss correlation properties of an 
ideal code set for a CDMA system. 

Consider a CDMA system with K users and a be- 
longing code set of K codes (one per user) c;[n], where 
7 = 1,2,...,K. Codes are periodic with a period N. We 
choose codes with the normalised power: 


Baa 2 lee? =1 (1) 


For an ideal CDMA system, two requirements should be 
met: 
1. To prevent signals of different users from interfering with 
each other, codes should be uncorrelated. This means that 
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the crosscorrelation function of every single pair of codes 
should be 0. 

Let r;,;[7] be a time-discrete periodic correlation function 
of codes c;{n] and c;[n]: 


N-1 
se 8 S> ef nlej[n +7] (2) 


n=0 


Equation (2) is the autocorrelation function when 7 = j, 
and the crosscorrelation function when 7 4 j. Note that 
a crosscorrelation function of periodic codes is periodic as 
well. 

As mentioned above, the crosscorrelation function in (2) 
should equal 0 for all pairs of codes and all values of Tr. We 
obtain: 


iF J, 
2. For successful synchronisation to the received signal, the 
autocorrelation functions of codes should be pulse-like with 


a zero out-of-phase value. Inserting this requirement into 
(2), we get the following identity: 


ri,j[T] = 0; i,j =1,2,...,K (3) 


70 
rell={ 6 O<7r<N-1 (4) 


which should be true for all codes 7 = 1,2,...,K. The 
system that fulfils this requirement is self-synchronising. 

It is easy to see that both requirements can not be met 
simultaneously. To prove this, let us consider them in the 
frequency domain. Let R;,;[k] be a discrete Fourier trans- 
form of a periodic crosscorrelation function: 


1 N-1 iS 
Rislk] = 3 D7 rialrle 9" (5) 


4°) 


which gives us the following Fourier pair: 
riglt] —> Ri,g[k] = CF[R] - C5[R] (6) 


where * denotes the complex conjugation. Applying this 
to (3), we obtain: 


Ci[k]-Cj[k]=0; #439, 4,7=1,2,.,K (2) 


Equation (7) shows that two codes are uncorrelated only 

if their frequency spectra are nonoverlapping. So, at each 

value of k, at least one spectrum should equal 0. 
Similarly, from (4), we obtain: 


Ri ilk] = Ci [k]C;[k] 
= |C;[k]|/? = 4 i=1,2,.. 
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In order to have a pulse-like autocorrelation function, the 
amplitude spectrum of each code should be constant, that 
is white, and, not equal zero. This is in contradiction to 
equation (7). The conclusion is that an ideal code set, 
defined through requirements (3) and (4), does not exist. 
Every code selection is a compromise of crosscorrelation 
and autocorrelation functions. 


III. THEORETICAL BOUNDS ON CORRELATION OF CODES 


Many authors worked on the problem of theoretical lower 
bounds on the correlation of codes. Most significant are 
the results of Sidelnikov [4], Welch [2] and Sarwate [3]. 
They derived several theoretical bounds, i.e. the Welch 
bound, modified Welch bound, Sarwate bound and Sidel- 
nikov bound, which are discussed in this section. 

A widely used criterion for a code set evaluation is the 
absolute value of the periodic correlation of codes [1]. The 
above-mentioned authors used the same criterion. 

For a given code set of K codes, let re be the maximal 
periodic crosscorrelation magnitude 


iF]; re) a Pe2si cn, (9) 
0<T<N-1} 


To = maz{|ri,;[T]|; 


r4 the maximal periodic out-of-phase autocorrelation mag- 
nitude 


a= mariir.s|T is t= 1,2, AK, 


1<tT<N-1} (10) 
and ry the maximal correlation magnitude 
rm = maz{ro,ra} (11) 


where ro and ra are from (9) and (10), respectively. 


A. Welch bound 


In [2], Welch derived the lower bound of the maximal 
correlation magnitude ray. The derivation, which is based 
on the expression for the even correlation moments, is valid 
for all codes of fixed norm (1). The result is: 


1 KN 
he lees ot NOES EE A ee) 
'M = EN D1 feed 4 ) 
s 


where s is a positive integer, K the number of codes, and 
N the period of codes. 

Relation (12) is shown in Figure 1. Different curves are 
for different values of s. According to (12), ray lies above 
or on the highest curve for a particular K. Consequently, 
the resultant Welch bound is compound of the highest seg- 
ments of the curves in Figure 1. The resultant Welch bound 
is given in Figure 2. Two remarks on the Welch bound in 
Figure 2 should be given: 

e If the number of codes K increases, maximal correla- 
tion ry increases as well, i.e. larger the number of users, 
stronger the mutual interference. 

e Thecurve of the Welch bound is not smooth, which seems 
unnatural. We could assume that the Welch bound is not 
the tightest theoretical bound possible. 
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logy K 
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Fig. 1. Relation (12) for different values of s (NV = 128) 


7 . N=128 


Fig. 2. Welch lower bound (NV = 128) 


B. Modified Welch bound 


With a slight modification of the Welch derivation in [2], 
the modified Welch bound can be obtained. The result i 
[3]: 


1S 


KN 
(N — 1)r28 + N(K — 1)r28 > =i 


where N is a period, K the number of codes, and s a posi- 
tive integer. The relation is valid for all codes of fixed norm. 
Inserting ry > rc and ry > ra into (13), we obtain the 
Welch relation in (12). Therefore, the Welch bound is only 
a special case of the modified Welch bound. 

The modified Welch unequality describes the relation be- . 
tween maximal crosscorreleation magnitude rg and maxi- 
mal out-of-phase autocorrelation magnitude r4, having K 
and N as parameters. In order to obtain the modified 
Welch bound for particular K and N, we have to calcu- 
late the rg - r4 curves for all s-es which obey the modified 
Welch relation, and then create the maximal curve from 
the segments with the largest rc values. The situation 
for N = 128 and K = N‘% is shown in Figure 3. Only 
s = {1,...,6} obey the modified Welch relation, while the 
highest segments are from curves with s = {4, 5,6}. 

Note that for larger values of K, larger s-es come into 
account, which is sketched in Figure 4, where the curves 
for K = {N?/2,2N,N/8} are shown. For example, at 
K = N?/2, the modified Welch bound is made up from 
three different segments s = {1,2,3}. At K = 2N two 
values of s = {1,2} come into account, while at K = N/8 


(13) 
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Ic N=128 , K=N4* 
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Fig. 3. Modified Welch relation for K = N 4 and N = 128 


TA 
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Fig. 4. Modified Welch relation for different K (N = 128) 


only s = 1 is relevant. 
Finally, the modified Welch bound is shown in Figure 5. 
Note again that the curves are strongly segmented. 


N=128 


Tc 
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Fig. 5. Modified Welch bound (N = 128) 


C. Sarwate bound 
In [3] Sarwate derived the following relation: 


N-1 


Niet = i 


rel (14) 
which is valid for all codes of fixed norm. It can be easily 
proven that this relation equals to (13) at s = 1, which 
is the only relevant. value of s for K < N. Therefore, the 


Sarwate bound is just a special case of the modified Welch 
bound. It is believed to be the best theoretical bound pos- 
sible for K < N [3]. 


D. Sidelnikov bound 


In [4] Sidelnikov derived relations, which are valid for 
codes, whose symbols are q-th roots of unity. The Sidel- 
nikov relation for g = 2, i.e. binary codes, is: 


1 s(s+1 28 Nea 
ra W2 (cis ay nine eee 


2 hit gee ( a ) 


(15) 
where K is the number of codes, N a period, and s an 


integer number 
2N 


0O<s< sa) (16) 
For q > 2, i.e. nonbinary codes, the relation is: 
1 |s+l1 25 Ne 
Te, = We a (AN — S) = (17) 


K(s!)? ( a ) 


where K is the number of codes, N a period, s an integer 
number s > 0. 
The relation (15) is shown in Figure 6. We obtain a 


™ 

0.35 = 

0.3 = 
0.25 = 

0.2 pa 
0.15 s=l 

0.1 us 
0.05 

logy K 
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Fig. 6. Relation (15) for different values of s (N = 128) 


family of curves for different s-es. The binary Sidelnikov 
bound is compound of the highest segments of the curves. 

Sidelnikov bounds for binary and nonbinary codes are 
shown in Figure 8. Note that, for K > N/2 (can be easily 
obtained from (15) and (17)), nonbinary codes demonstrate 
much lower correlation. The reason, why binary codes (e.g. 
the Gold, Kasami, Walsh-Hadamard codes) are so popular, 
is not in better performances, but in their simple usage. 
Note that the Sidelnikov curves are segmented. 


IV. COMPARISON OF THEORETICAL BOUNDS 


We described four different theoretical lower bounds on 
correlation of codes. In this section, we compare these 
bounds and discuss their limitations. 

We already know that the Welch, modified Welch and 
Sarwate bounds are general bounds for codes of fixed norm. 


AA en nL OEE _ EE ee 
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On the other hand, Sidelnikov bound is restricted to codes 
whose symbols are roots of unity. Let us first discuss the 
general bounds, and then compare them to the Sidelnikov 
bound. 


A. Comparison of general bounds 


We point out two important facts, which follow from the 
descriptions in the previous section: 
e The Sarwate bound is just a special case of the modi- 
fied Welch bound. Putting s = 1 into the modified Welch 
relation (13), we obtain exactly the Sarwate relation (14). 
e The Welch bound is also a special case of the modified 
Welch bound. A drawback of the Welch relation is that 
it does not differentiate between crosscorrelation r¢ and 
autocorrelation r4. Thus, the Welch bound gives us much 
less information about the correlation behaviour of codes. 
The example is shown in re - rg graph in Figure 7, where 
the Welch bound is presented as a square of a side ry. 
The upper right corner lies exactly on the modified Welch 


Tc N=128 


0.08 modified Welch bound 


yk 
0.06 
0.04 <— Welch bound 
0.02 


TA 
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Fig. 7. Welch and modified Welch bounds at K = N (N = 128) 


curve. 

In short, the modified Welch bound is the most general 
lower bound known. It provides an insight into ro - ra 
behaviour of the correlation of codes. 


B. Comparison of Welch and Sidelnikov bounds 


Comparing the Welch and Sidelnikov bounds, we can 
figure out the differences between general codes of fixed 
norm and codes, whose symbols are roots of unity. The 
bounds are shown in Figure 8. We see the following: 


 Sidelnikov 
bound (q>2) 


logy K 


Fig. 8. Welch and Sidelnikov bounds (N = 128) 
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e General codes of fixed norm demonstrate the lowest cor- 
relation values. Every further restriction on the symbols of 
codes causes an increase of correlation values. 

e The worst from the correlation perspective are binary 
codes. Thanks to their simple implementation in a CDMA 
system, they are still widely used. 


C. Example 


Let us consider an example of K = N?/2 binary codes, 
N = 128, in an ro - rg graph. Theoretical bounds for 
this case are shown in Figure 9. The resultant theoretical 
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Fig. 9. Resultant bound for K = N?/2 (N = 128) 


bound (dotted curve) is compound of the Sidelnikov bound 
for binary codes and modified Welch bound. The curve is 
segmented. It is very likely that this is not the tightest 
theoretical bound possible. 


V. CONCLUSIONS 


The comparison showed that, for general codes, the mod- 
ified Welch bound is the best theoretical bound known. 
For codes whose symbols are roots of unity, the Sidelnikov 
bound is much tighter. A drawback of the Sidelnikov re- 
lations is that they do not differentiate between crosscor- 
relation rg and autocorrelation r4, but only express the 
largest of them as ry. Since rc - ra representation of 
the correlation properties of codes is very instructive, we 
propose a modification of the Sidelnikov derivation. 

In addition, all the theoretical bounds discussed in the 
article have strongly segmented courses for larger K. This 
is very unnatural, and, it appears likely that the bounds 
can be further improved. This is the topic that deserves to 
be examined in the future. 
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Transmission Power Management and Control 
in Third-Generation W-CDMA-Based Cellular 
Systems 


P Demestichas, A.Igoumenidis, G. Kotsakis, V.Demesticha, E.Tzifa, M.Anagnostou, M.Theologou 


Abstract-The adoption of W-CDMA as an essential 
component of the air-interface of third generation cellular 
systems brings in the foreground the need for new planning 
methodologies and software tools. In this perspective, this 
paper addresses planning problems that are important to the 
dimensioning of W-CDMA-based cellular networks. The 
problems aim at finding the optimal feasible allocation of 
transmission power to the sets of uplink and downlink 
connections that should be supported by the system, so as to 
cope with a corresponding traffic load scenario. In this paper 
we focus on the uplink problem. Specifically this problem is 
concisely defined, mathematically formulated and solved by 
means of a computationally efficient, novel algorithm. The 
solutions of the problem will be exploited by management and 
control schemes in order to improve the system performance. 


Index Terms-- UMTS, QoS, Planning tools 


I. INTRODUCTION 


Third-Generation mobile systems such as the Universal 
Telecommunications System (UMTS)[1,2,3,4], (IMT-2000) 
[5] promised to offer a variety of highly sophisticated 
communications services to mobile users [6,7]. Their 
commercial success will depend on the cost-effective 
provision of these services at Quality of Service (QoS) 
levels that are comparable to those offered by fixed 
systems. The support of these services is coupled with the 
efficient utilisation of the air-interface. 

In the beginning of the 1998, the Special Mobile Group 
(SMG) of ETSI decided that W-CDMA would be a key 
component of the UMTS air-interface, which is called 
UMTS Terrestrial Radio Access (UTRA) 
[8,9,10,11,12,13,14,15]. 
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The adoption of W-CDMA brings in the foreground th 
need for new methods and software tools that will enab! 
cellular system operators to properly dimension their ai 
interface [7,8,9,10,11]. 

Successfully dimensioning the air-interface of a cellulz 
system may be interpreted as to enable support of a set « 
connection demand vectors. Each connection deman 
vector is related to a traffic load situation that the syster 
will have to handle. It provides the maximum number 
connections per service and portion of the service area thé 
the system should be capable of simultaneously supportin; 
so as to cope with the offered traffic. In W-CDMA system 
the main factor that may prevent the accommodation of 
connection demand vector, and therefore, may limit th 
system capacity, is the lack of a feasible allocation 
transmission power to the connections. Therefore, a 
important problem that should be solved in the context c 
dimensioning the air-interface of a W-CDMA is th 
following: “Given the service area layout, elements of th 
system architecture and characteristics, namely, the ce 
coverage and the equipment capabilities, the propagatio 
conditions in the service area, the characteristics and Qo 
requirements of the offered services, and a service deman 
pattern (connection demand vector deriving from the uss 
behaviour and distribution models and the servic 
characteristics), find an optimal feasible allocation 
transmission power to each uplink and downlink connectio 
designated by the service demand pattern”. 

The above problem is decomposed into sub-problems. Th 
first is called Aggregate Allocation of Transmission Powe 
—Uplink (AATP-U) and aims at finding an allocation « 
transmission power to the uplink connections that th 
system should be capable of simultaneously supportins 
according to the service demand pattern. The secon 
problem is called Aggregate Allocation of Transmissio 
Power-Downlink(AATP-D) and aims at finding a 
allocation of transmission power to the downlin 
connections. 

The problems outlined above are related to previous wor 
on transmission management and control [16,17,18]. The 
solutions will be used by management and control scheme 
in order to improve system performance [19,20,21]. Th 
rest of this paper is organised as follows. The AATP-I 
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problem is described in more detail in section II. The 
AATP-D problem is omitted for brevity. Section Ii 
includes the optimal formulation and the solution to the 
problem. Finally, section IV includes concluding remarks. 


II. FORMAL DEFINITIONS AND PROBLEM STATEMENT 


This section describes in more detail the AATP-U problem. 
Our first assumption is that the service area layout is 
represented by a set of pixels denoted as P. The set of cells 
in the system is denoted as V . Function pix(y) provides the 
pixels that belong to cell y (veV) and function ¢(p) 
provides the cell in which pixel p (pep) belongs. 
Function /(y) provides the location of the BTS of cell v 
(v eV ), i.e., the pixel of pix(y) in which the BTS is located. 
The propagation conditions in the service area may be 
described by a function A( Pq) (( p,q) e P?). This function 
provides the average lognormal attenuation of a 
transmission that originates from pixel p and terminates at 
pixel g. For example, || ple »))| (peP) denotes the 
attenuation of an uplink transmission that originates from 
pixel p and terminates at the BTS site of the cell in which 
the pixel belongs. This may equivalently be expressed as 
A(p,l(v)) (p €P, v eV ) provided that p € pix(v). 

The set of services that are offered by the system is denoted 
as S. The minimum acceptable signal-to-interference ratios 
that should be provided to the uplink connections of service 
s (s eS) are denoted as SIR (s). 

The network element-capabilities provide the maximum 
power at which mobile terminals (stations) may transmit. 
These measures are denoted as p™*(s) (s eS). The notation 
implies that terminal classes with different capabilities are 
considered. The classification is based on the service (or set 
of services) that may be accessed through the terminal. 

The connection demand vector of the service demand 


~ 


pattern may be denoted as fV, = {n,(,5) (p,s) €Px s}. 


More specifically, the sets f, describe the spatial 
distribution of the connections that the system should be 
capable of simultaneously supporting, so as to cope with a 
corresponding traffic load situation. Hence, n,(p,s) denotes 
the number of uplink connections of service s that 
originate from pixel p. The allocation that is sought by the 


AATP-U problem may be formally described as P = 


{B(p,5) 


P(p,s), denotes the transmission power that should be used 


(p,s) Px s}. Each element of the allocation, 


by an uplink connection of service s that originates from 
pixel p (and terminates at the BTS of cell ¢(p)). The 


allocation should minimise a cost function ct PB ), which is 


associated with the aggregate transmission power that is 
allocated to the uplink connections. 

The constraint of preserving the signal-to-interference ratio 
‘of each connection above the minimum acceptable 


threshold may be expressed as Q.(P,,p,s)> SIR,(s) for the 


uplink. Function @Q(%,p,s) provides the  signal-to- 

interference ratio that will be experienced, as an outcome of 

the allocation ? , by the uplink connections of service S$ 

that originate from pixel p. The constraint of assigning 

transmission powers that are compatible with the equipment 

capabilities may be expressed as Pp ( DP; s)< pr=(s). The 

discussion above leads us to the following formal statement 

of the AATP-U problem. 

Problem : [Aggregate Allocation of Transmission Power on 

the Uplink — AATP-U]. 

Given: 

1. the set of pixels P that represent the service area 
layout, 

2. the set of cells VY , a function pix(y) that provides the 
set of pixels that belong to cell v eV, a function ¢(p) 
that provides the cell in which pixel p belongs, and a 


function /(y) that provides the location (pixel) of the 
BTS of cell veV, 

3. the propagation conditions in the service area, 
expressed by the function 4(p,q) ((p,g)eP?) that 
provides the attenuation of transmissions that originate 
from pixel p and terminate at pixel g, 

4. the set of services offered by the system § and for each 
seéS the minimum signal-to-interference ratio that 
may be tolerated by an uplink connection of the 
service, SIR (s) , 

5. the network element capabilities, i.e., the maximum 
power at which the various mobile terminal classes 


may transmit, p™*(s) (s eS), 

6. the service demand pattern, i, a _ vector 
Nie {n,(p,5) (p,5) e Px s} providing the number of 
uplink connections that should be supported on a per 


service and pixel basis, 
find an allocation of transmission power to the connections 


of V,, B= {P(p,5) (p,s)ePxs}, that minimises a cost 
function C(®), which is associated with the aggregate 
allocated power, subject to the constraints 
o(B, p,s)2 SIR,(s), i. of guaranteeing the SIR 
requirement of each uplink connection, and 
P(p,s)< p™*(s), ie. of being compatible with the 
equipment capabilities. 


III. FORMULATION AND SOLUTION OF THE AATP-U 
PROBLEM 
This section starts from an optimal formulation of the 
AATP-U problem. In the sequel, there is an elaboration on 
the optimal formulation, which leads to the computationally 
efficient solutions that is presented in the last sub-section. 


A. Mathematical Formulation 


The AATP-U problem may be mathematically formulated 
as follows: 
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Meyers C(B)= DY n(p,5):2(ps) 
Subject to, 
0, (B,, p,s)> SIR,(s) 
P(p,s)- A(p,(»))  SIR,(s) 
Py (9.5): B.(q,5)- A(q.M(v)) + 1,,(v) + NW 
v(v, 2,5) €(V x pix(») x 8) (2) 
P(p,s)< V(p,s) «(Px S) (3) 
Teal) = e he P(g,3)-A(g.v)) Wve” (4) 
aq P-pix(v)) F<S 
B= {P(p,s)| (p,s) Pxs} (5) 


Cost function (1) provides the aggregate power levels that 
will be transmitted by the mobile terminals of the system. 
The set of constraints (2) ensures that the signal-to- 
interference ratio that will be experienced by each uplink 
connection, as an outcome of the ? allocation, will exceed 


the minimum acceptable threshold. The SIR‘ (s) value is 
related to the S/R (s) value through the relation s7R'(s) = 


SIR,(s) /( rs SIR,(s)) (the proof is omitted for brevity). N, is 
the noise spectral density and W is the bandwidth available 
in the cell. The set of constraints (3) ensures that the power 
at which the mobile terminals will be asked to transmit is 
compatible with their capabilities. The set of relations (4) 
provides the external interference sensed at the BTS site of 
each cell v. This interference is caused by the 
transmissions that originate from pixels that do not belong 
in Vv. 


B. Elaboration on the formulation 


A first observation on the mathematical formulation of the 
previous subsection is that the minimisation of the 
aggregate allocated uplink power may be achieved if 
constraints (2) stand with equality. Therefore, they can also 
be rewritten as follows: 


Aes) ABH) 5 59 (6.3).nlg)- Aas 


qepix(v) FES 


1 (v) + NW 


V(v, p,s) € VV x pix(v)x S) (2a) 


In the set of equalities above, focusing on an arbitrary cell 
V, it may be observed that for all pairs ( P, s) ( pix(v) x s) 
the right sides are equal, which means that the following 
relations hold: 


Plp.s)-Alp.l)) _ p ) ,v(p,s) (patv)x5), v eV (6) 


An important remark that may be deduced from the relation 
above is that the calculation of the P,,(v) values is 


sufficient for the solution of the AATP-U problem. From 
the P.,(v) values the optimal powers P(p,s) may be 
computed through relations (6). This approach entails a 
significant decrease in the complexity of the solution of the 
AATP-U problem. The physical meaning of relations (6) is 
that for each cell v eV there is a target power level, P,(v)> 
at which the controlling BTS should receive the 
transmissions that originate from the cell. Hence, the 
transmission power of each connection in the cell should be 
adjusted so as to comply with the target reception level at 
the BTS site. This result is in accordance with those of early 
studies on the capacity of DS-CDMA systems, involving a 
single offered service. 

By appropriately combining relations (6) and (2a) it is 
obtained that the p(y) quantities may be calculated 
through the following system of equations (the detailed 
derivation is omitted for brevity). 


P,(v) = Len (v) + NW VveV (7) 


1- Dk, (a) 


gepix(v) 
The ,(g) quantity is defined as x (q) = Yn, (9,3): SIR(3)> 
FeS 
and may be seen as an expression of the load that 
corresponds to the uplink connections that originate from 
pixel q . Moreover, J, (v) can be expressed as: 
L,(v) = p(s). Yk eel Vv eV (4a) 
De |e. ps DAG a): A(q,i()) 
C. Algorithm 


This sub-section includes an algorithm for the AATP-U 
problem. The algorithm starts from the computation of the 


optimal P(v) values and then proceeds to the calculation 
of the P(p,s) values through relations (6). The algorithm 


relies on relations (7) and (4a), which are solved in an 
iterative manner as follows: 


Pis!(y) = Teu(v) + NOW Vv eV (7a) 
0) 
Ks g set vveV (4b) 
oo Fel): alla a): Aq, l() 


The starting point is PS(v) =0 for all y eV. Formally the 
algorithm can be described as follows: 

Algorithm for the AATP-U problem 

Step 0: Initialise the algorithm iteration counter, i, and 
the initial power values, ie., set 1=0, pi(y)=0 for all 
veV. 

Step 1: Compute for all vev the /' (y) quantities 
through formulas (4b). 


up ; 
SIR,(s) Step 2: Compute for all veV the Py ‘(v) quantities 
through formula (7a). 
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Step 3: Evaluate whether the termination criteria are 
satisfied. If the termination criteria are not satisfied set 


Pi(v)= pi(y) for each veV, increase the algorithm 


iteration counter, i.e., set i= i+1, and go to step 1. 

Step 4: Compute the optimal p(p,s) values by using the 
Pi (y) values and relation (6). 

Step 5: End. 

The termination criteria in step 3 depend on the evolution of 
the p‘(y) values. When a feasible solution exists, the above 


algorithm will converge to the optimal values P(v) = 
tim{ Pi, (v)) < o, for every veV. In the opposite case, the 


P.(yv) values will tend to infinity. For this reason, the 


algorithm can be terminated in one of the following cases. 
First, at step i in which the condition 


(Pir(v) - le (v))/ jg (v)< € (e<<!1) is satisfied for every 
veV. Second, in case the condition concerning the 
terminal power budget is violated, i.e., when joa (v) > 


min |P™(s)- (p,i(v)) /siR,(s)]- The first termination 


pepix(v);s 

criterion means that the AATP-U algorithm successfully 
accomplishes its task, by converging to a feasible solution. 
The second termination criterion means that the algorithm 
fails to find an acceptable solution. 


TV. CONCLUSION . 


This paper addressed a planning problem that is important 
to the dimensioning of W-CDMA-based cellular networks. 
The problem aimed at finding the optimal feasible 
allocation of transmission power to the sets of uplink 
connections that should be supported by the system, so as to 
cope with a corresponding traffic load scenario. The 
problem was concisely defined, mathematically formulated 
and solved by means of one computationally efficient, 
novel algorithm. 

The algorithm presented herewith is suitable for application 
in the design, or the re-engineering, phases of the system. In 
this perspective, they are seen as components of a suite of 
software tools that are targeted to the dimensioning of W- 
CDMA-based networks. The extension of our work will 
lead to algorithms that are suitable for use in the 
management domain. 
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Abstract 

An alternative approach, based on an iterative least- 
squares optimization technique, for the design of 
antenna arrays exhibiting equal side lobes in their 
radiation patterns is presented. For a given number 
of elements equispaced by a constant amount, the 
algorithm returns the excitation currents under the 
condition that all side lobes are at the same level. 
The results show excellent agreement with those 
reported using other approaches. 


Indexing Terms: Antenna arrays, Equi-side lobe 
radiation patterns. 


I. Introduction 

Tapering the excitation currents in antenna arrays is 
known to reduce the lobes level especially those 
adjacent to the main beam [1]. Suppressing the side 
lobes saves power in undesired directions on the 
expense of broadening the main beam, i.e., less- 
directive radiation. This trade-off is optimized by 
utilizing the well-known Chebychev polynomials 
[2]; the pioneering work of Dolph is usually cited in 
this regard [3]. A formulation, which makes no 
direct use of these polynomials, was given in [4]. 
The formulation was given in terms of matrices and 
one needs to invert a square matrix whose size is 
proportional to the number of elements in the array. 
The interelement optimum spacing given in [4] was 
derived elsewhere [5]. Other investigators (e.g. [6]) 
used neural network techniques in order to tailor the 
array factor. No indication to computational time 
was mentioned. This paper formulates the problem 
as an optimization one in the least-squares sense. For 
a given number of elements arbitrarily excited an 
iterative algorithm, based upon Fletcher-Powel 
technique [7], is used to produce the desired 
excitation currents necessary to realize a Chebychev 
array. 


II. Formulation 
A linear array consisting of N elements 
symmetrically distributed with constant interelement 


.edu.jo 


spacing is considered. The array factor of such 
geometry is given by [5] 

N/2 

l 
AF =2) I, cos((n-—)w), (1) 

2, , cos((n-—)y) 
for even number of elements. In the case of odd 
number of elements, the array factor could be written 
as [5] 


N-1 


AP =I Sy cos(ny) (2) 
n=1 
In Egs. (1) and (2), y=kd cos6+B , k=2n/A; d being 
the wavelength, d is the interelement spacing, 8 is 
the angle measured from the line of the array, /, is 
the magnitude of the current for the nth element on 
the either side of the array midpoint, and Jp denotes 
the current of the center element when N is odd. The 
position of the main beam could be placed on the 
broadside or on the endfire of the array axis or 
otherwise by properly selecting the value of B. Here, 
broadside arrays, i.e., B=0 will be considered, the 
extension to other cases is straightforward, however. 
The statement of the problem is that if we are given 
d and the initial setting of currents {7} }, which 
certainly produces unacceptable side lobe levels, 
how do we determine the current distribution { J, } 
to achieve our goal that all the side lobes are at the 
same prescribed height. 


III. Design Algorithm 

In this section, the design algorithm is presented for 

controlling the array factor characteristics. The 

current excitations {/,} are sought so as to equate 

all the side lobes appear in the array factor. The 

process of achieving this may be described in the 

following 

1. Specify the number of elements N and their 
interelement spacing d. 

2. Start with arbitrary excitation currents { J Y Me 

3. Calculate the array factor AF(/}) using either 


Eq.(1) or (2) in the range 0< O<7. 
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4. Examine the obtained response for local maxima 
in the undesired directions. These values 
Ar, °) and their locations 0, are saved in 
an array to be used in the next step, 
1=1,2,++-, Dix - Here, P,,», denotes the number 


of peaks appear in side-lobe band. 

5. The error function, e, defined as the summation 
of the square of the differences between the 
obtained peaks and the specified lobe level (5) is 


constructed 
Pmax 
2X CIROR Sy ME) 
i=l 
6. Minimize the error function as given in 
Eq.(3) by using the gradient ( V ) of e 
Oe 
Ve=— 4 
= (4) 


The evaluation of Eq.(4) demands finding the 
derivative of the array factor with respect to each 
{7,} at 8=0, . Thus a correction term by 


which the design currents are updated may be 
found. 

7. At this stage, the difference between the new 
solution and the previous one is checked. If it is 
smaller or equal to a specified tolerance «, the 
optimization process is terminated. Otherwise, a 
new iteration should be carried out. The process 
may also be terminated if e<e,; wheree, isa 
specified minimum error. There is a possibility 
to terminate the whole process if the maximum 
number of iterations has been reached. 

The above algorithm is summarized in the flow chart 

of Figure 1. 


IV. Illustrative Examples 

The above procedure is used to reproduce some of 
the results given in [4]. Table 1 shows that the results 
of the present procedure are in excellent agreement 
with that presented in [4] and _ using classical 
Chebychev polynomials approach [1,5]. In the table, 
the current excitations are normalized with respect to 
the edge current, which is unity, and the SLL denotes 
the side lobes level, given in decibels, with respect to 
the main beam. Needless to say that here neither 
inversion of matrices nor tedious comparison of like 
power terms is necessary. From computational point 
of view the alogorthim returns the desired currents 
with fairly reasonable time. The five elements 
example given in [8] with 4/2 as an interelement 
spacing is considered. The algorithm starts with a 
uniform amplitude excitation with an objective to 


equate all the lobes at -20dB level. The excitation 
currents as returned by the suggested algorithm are 
T,={1.00 1.6085 1.9319 
1.6085 1.00} 
The array factor of this array is shown in Figure 2. 
This is in agreement with [8]. As a last example we 
consider six elements half-wave spaced and require 
that their radiation pattern is clean of lobes (i.e. the 
lobes are at — 00 dB level). The currents are given by 
1,={1.00 5.0178 10.0551 
10.05515.0178 1.00} 
This case corresponds to a binomial array [9] and the 
currents may be calculated using the Pascal triangle. 
Both approaches are in excellent agreement. 


V. Conclusions 

An efficient fast algorithm to design Chebychev 
arrays has been presented. The algorithm avoids the 
direct use of Chebychev polynomials or sitting up a 
matrix through which the excitation currents are 
sought. This is become a time-consuming when great 
number of elements ' are used. The algorithm is 
verified by comparing the results with other 
investigators and found in excellent agreement. The 
limiting case of the absence of side lobes from the 
array factor is checked and the expected result of 
binomial array has been obtained. 
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Figure 2 Radiation pattern of 5-elements Chebychev array with -20dB side lobe level 
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the number clements (V) 
the current excitations { /° } 
the spacing d 


= the desired lobes level (5) 
® tolerance (s) 
Number of iterations (Jmax) 


Calculate the array factor AF( ih ) 


Find the local maxima of the side lobes AF, their locations 0, __, and the number 
of peaks p..,, 4=1,2,---, Pmax 


Find the error function e: 


e= S'(AF_, @)-8) 


Find the gradient Ve and update /,, 


P= (Uh) - dl, 


No 


No Ea lee 


and/or es<e, 


<a 


Yes 
Write /, 


Figure 1 Flow chart for obtaining Chebychev array 
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Mobile Communication in Road Tunnels: 
Influence of the Traffic Conditions on the 
Channel Characteristics 


Samuel Betrencourt, Martine Lienard, Pierre Degauque, Member, IEEE and Daniel Degardin 


Abstract--This paper presents a theoretical and experimental 
approach of the propagation characteristics in a road tunnel. 
The carrier frequency is 2.1 GHz and both a narrow band and a 
wide band analysis have been carried out. 


Index Terms--Tunnels, impulse response, 
coherence bandwidth, channel characteristics. 


delay spread, 


I. INTRODUCTION 


The determination of the channel characteristics when a 
mobile moves inside a road tunnel has often been studied 
theoretically, assuming a simple geometrical configuration. 
Indeed, for a rectangular and straight tunnel, the 
determination of the images of the fixed antenna placed inside 
the tunnel is straightforward and the image theory is then 
applied. A more sophisticated approach can be used by 
dividing the tunnel walls into multiple facets in order to better 
fit the actual shape of the tunnel [1]. Nevertheless, the 
convergence is rather tedious and becomes time consuming 
for long tunnels. Furthermoye, the introduction in such models 
of vehicles with random size and position seems a very hard 
task. It is the reason why extensive measurements have been 
carried out in various road tunnels and in different periods of 
the day in order to point out the change in the channel 
characteristics as a function of the vehicle density inside the 
tunnel. 

For all configurations, the transmitting antenna is placed 
on a post at the tunnel entrance and a frequency range of 2.1 
GHz is considered. First a narrow band analysis is described, 
by considering the field amplitude variation when the 
receiving vehicle moves in a lane at the same speed as the 
other vehicles. Many distributions have been proposed to 
describe the statistics of the short term signal amplitude when 
a mobile moves in an urban environment, Rayleigh, Rice, 
Weibull and Nakagami-m being the most popular [2]. 
Nevertheless, the received signal in the tunnel can be 
considered as the sum of two main contributions: The 
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multiple reflections on the tunnel walls but mainly with a 
grazing angle of incidence, and the reflections on the 
vehicles. For the first contribution, the rays playing a leading 
part come mainly from the direction of the tunnel axis and, in 
other words, the spread of the angle of arrival is small. 
However, the change in their phases gives rise to deep fadings 
when the mobile is in movement. For the second contribution, 
the other vehicles are more or less distributed around the 
receiver but important masking effect may also occur in case 
of the presence of trucks for example. It is thus important to 
determine the best procedure to characterize the fast fadings 
by studying the influence of parameters such as the length of 
the interval on which the statistics is calculated. 

The complex impulse response of the channel is also 
measured and the direction of arrival is deduced. 
Experimental results will be presented showing the influence 
of the traffic density on the delay spread. 


II. NARROW BAND ANALYSIS 


A. Field amplitude distribution 


The tunnel in which the experiments have been carried out 
is a straight one way, four lane tunnel, slightly arched and 
which has been simulated by a rectangular cross section, 16m 
wide and 7m high. The transmitting and receiving antennas 
are situated at a height of 2m and at a distance of 1m and 6m 
from the vertical wall, respectively. The theoretical model is 
based on the image theory, the contribution of each image 
being weighted by the reflection coefficient on the walls. 

Figure 1 represents the field amplitude variation which has 
been calculated from the propagation model. First, it clearly 
appears (curve b) that the contribution of the direct is 
negligible and that the mean variation of the total field (curve 
a) can be fitted by a double logarithmic regression. For this 
configuration, the break point is situated at a distance of about 
300m from the transmitter. One can thus determine the 
distribution of the field around the two regression lines, and it 
appears that both the theoretical and experimental values 
follow a Rayleigh distribution. It must be emphasized that the 
experiments took place either during the night, when the 
tunnel was nearly empty, or during the rush hours. Such a 
distribution was already obtained by making a narrow band 
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analysis of the field propagation in large railway tunnels, as 
discussed in a previous paper [3]. 


tunnel 8°16 f=2.1GHz 


puissance recue 


n 
° 500 1000 1500 
distance TX RX 


Fig. 1: Field amplitude variation between 30m and 1500m. 
Curve (a) in Figure 2 represents the field variation 


measured every lcm in a nearly empty tunnel, while curve (2) 
has been obtained in case of a heavy traffic. 


tunnel trafic/vide f=2.1GHz 


puissance recue dBm 


0 100 200 


300 400 500 600 
distance TX RX m 


Fig. 2. Measured electric field in a real tunnel. -a-: Low 
traffic; -b- Dense traffic. 


For a better visualization of the Figure, curve -b- has been 
shifted of 30 dB. A statistical analysis on many stored files 
shows that a dense traffic gives rise to an additional 
attenuation of the received power. As a result, for a distance 
greater than 300m, there is nearly no change in the slope of 
the path loss and thus practically, a one slope model fits the 
experimental values. 


10“ Mediterranean Electrotechnical Conference, MEleCon 2000, Vol. I 


B. Comparison with a Rayleigh distribution 


As we have previously mentioned, a Rayleigh distribution 
is well suited to describe the field variation along the total 
distance under consideration. Nevertheless, it can be 
interesting to introduce a quantity which can express this 
agreement in a more quantitative way,. To achieve this 
objective, one can start from the probability density function 
p(t) corresponding to a Rayleigh distribution. It takes the 
form 


p(t) = exp(-1” /20”) (1) 
oO 


The mean m(r) and the standard deviation o, of the signal 
envelope r(t) are related to o by the following expressions: 


m(r)= V¥1/20 (2) 


and 
6, =V2-n/20 (3) 


In order to generate curves of the probability density 
function or of the cumulative function, the parameter o has 
been calculated so that the mean values m(r) deduced either 
from the Rayleigh distribution or from the actual distribution 
of the field are identical and normalized to 1V/m. In that case, 
(2) leads to o = 0.8. Calculated in that way, the parameter o 
will be noted 6, . 

On the other hand, o can also be deduced from the value of 
the standard deviation of the signal envelope; that value of o 
deduced from (3) will be identified by o,. The relative 
discrepancy between o, and 6, will provide quantitative 
information on the discrepancy between the calculated 
distribution and a Rayleigh distribution. 

The statistical study has been made by calculating the field 
variation around the regression line for a distance between 
300m and 600m from the transmitter, but by increasing 
continuously the interval D on which the statistics is 
calculated. 

Curves in Figure 3 represent the variation of the ratio 0,/Om 
for different traffic conditions. 

Curves (a) and (b) are two examples of results which have 
been obtained in a nearly empty tunnel. One can note that this 
ratio tends to unity for an interval D greater than 100m. It 
means that if the statistical fluctuation of the field around the 
regression line is calculated on an interval equal or greater 
than 100m, the field amplitude follows a Rayleigh 
distribution. Curve (c) is a typical example of the variation of 
this ratio when the traffic in the tunnel is heavy. By 
comparing this curve to the previous ones, one remarks that it 
tends to 1 more rapidly. This is of course due to the fact that 
the various vehicles give rise to additional and random 
reflections. Nevertheless, it may appear in few cases that the 
agreement with a Rayleigh distribution is worst in presence of 
heavy traffic, compared to the case of a nearly empty tunnel. 
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ig. 3. Varmon of the rato ,/o,, versus the length of the 
mterval D on which the statistics has been determined. 


An example is pomied om by cave @) It is difficuk to 
know precisely the caviromment of the mobile receiving 
amcnma because ts cuve SS extracted from numerous 
records. Ome can mmagime thet such a behavior was obtained 
im presence of large webacles or tracks, masking most of the 
rays and thus docreasing appreciably the spread of the angles 
of sanival of the rays. 
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A Fregnency domam 

The channel characteristics were first calculated from the 
Propagation model based om the ray theory. The coherence 
bandwidth can be deduced either from the complex transfer 
faction of the channel or ffom the envelope correlation. In 
this paper, we wall only comsider this last way of calculation. 

‘The transfer function has been calculated versus frequency 
aad Si many receiving points. Curve im Figure 4 represents the 
variamon of this coherence bendwadth versus the distance 
between the transmitter and the receiver, varying from 30m to 
300m. 

ft first appears that the cohereace bandwidth is strongly 
Gependent on the position of the receiving antenna, since it 
may vary betweca 7 MHz and 40 MHz. This is 2 typical 
festare of the propazation im tunnels and it is easiest to 
explam tus resuk im time domain. 

Let ws asseme that the rays arriving with a grazing angie of 
mmcadeace On the tunnel walls, combine im phase im a specific 
ae wery small, one can expect that the delay spread will be 
small, leading to 2 large value of the coherent bandwidth. On 
the comirary, it may happen thet the rays propagating nearly 
along the tunnel axis are opposite im phase. In this case. their 
commybutions will have the same order of magnitude as those 
Gae t high onder reflecting rays, giving rise to an increase of 
the Gelay spread and 2 small coherence bandwidth. 
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Fig. 4: Variation of the coherence bandwidth versus the 

position of the receiver. 


It must also be noted im this Figure that the mean value of 
the cohereace bandwidth does not depend on the position of 
the receiver and, for this tunnel, is equal m 12 MHz A 
parameine Stady shows that in a smaller two lane tunnel, the 
expected bandwidth is on the order of 30 MHz. 


B. Time domain 


A channel sounder was used to stady the channel behavior 
im Gme domam. ft is based on a comelation between a 
Transmitted pseudo random sequence and the received one. 
The bandwidth of the system is 42 MHz, kading to a 
resolution in me domain of 24 ns. The transmitting antenna 
was 2 half wave dipole while the receiving one was a quarter 
waveleagth antenna put on the roof of the car moving im the 
tunnel. Thousands of records were made im different traffic 
conditions. In most cases, the impulse response has the shape 
indicated im Figure 5. 
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It appears that the main peak has an amplitude 15 dB 
higher that the others and one can thus consider that the only 
effect of the multipath propagation is to slightly increase the 
width of the main pulse. Furthermore, the resolution of the 
channel sounder was not fine enough to separate the 
contributions of the different rays inside this pulse. Indeed, a 
parametric study shows that, in such a tunnel, the delay spread 
is on the order of 15 ns and cannot be visualized with this 
channel sounder. Nevertheless, on about 5% of the records in 
presence of heavy traffic, it is possible to put in evidence 
reflections on trucks or vans. Examples are given in Figures 
6a and 6b. 


amplitude en dE 


temps ens x 10° 


bine 


Fig. 6: Examples of impulse responses showing reflections 
on various vehicles inside the tunnel. 


The delay spread has been calculated from these records 
and if an attenuation of 15 dB is considered, the statistical 
analysis show that 40% of the impulse responses present a 
delay spread smaller than 50 ns, 40% between 50 ns end 250 
ns and lastly, 20% between 250 ns and 500 ns. 


IV. CONCLUSION 


Experiments have been carried out in a four lane tunnel 
about 16m wide and 7m high. If the tunnel is nearly empty, 
the path loss between 30m and 600m from the transmitter can 
be approximate by a two slope model. It appears that in this 
configuration, the break point is situated at a distance of about 
300m. For tunnels of smaller cross section, one can expect to 
get the break point at a smaller distance since the high order 
modes excited in the tunnel are more strongly attenuated. 

Near the transmitter, the attenuation per unit length is 
5 dB/100m and becomes smaller than 1 dB/100m in the far 
zone. In the presence of a heavy traffic, the attenuation 
increases and a one slope model fits better the experimental 
data. In this case, the slope is on the order of 2 dB/100m to 
3 dB/100m. 

The wideband analysis shows that in an empty tunnel, the 
coherence bandwidth is equal to about 13 MHz while the 
average delay spread is less than 20 ns. In presence of a heavy 
traffic, only 5% of the records exhibit clear additional 
reflections. Few examples of delay spread have been 
presented. Other experiments will take place to point out the 
effect of the tunnel geometry. 
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Uninterrupted Data Transmission over GSM. 
and Application to Health Services 


C.F.Grecas, I. S. Venieris’, 


Abstract- This paper seeks to evaluate the sufficiency of the 
GSM radio functionality in suppo'rting the case of patient’s 
data transmission from a moving ambulance. Employing the 
TCP/IP and the UDP/IP protocol duos, repeated 
transmissions of a file are carried out. The reliability of the 
data coding used by the GSM is examined whilst the stability 
of the connections realized in the GSM air environment is 
investigated. Finally, suggestions are made and solutions are 
proposed in order disconnections, due to radio link blackouts, 
to be avoided. 

Index Terms: GSM link blackout; GSM link stability-- 


INTRODUCTION 


Data transmission over GSM [1, 2] is currently served by a 
number of bitrates from 300 bps up to a Full-rate Traffic 
Channel of 9.6 Kbps (TCH/F9.6) in a singleslot circuit- 
switched mode. New multislot bandwidth widening 
technologies, based on the current GSM architecture, such 
as High Speed Circuit Switched Data (HSCSD), with 
maximum bitrate of 76.8Kbps, and General Packet Radio 
Services (GPRS), with maximum bitrate of 100 Kbps, are 
going to be introduced shortly. [3, 4, 5]. 

Anyway, the link stability over the radio interface in the 
current GSM singleslot transmission or in the tomorrow’s 
multislot allocations is of extreme importance. 

Having in mind patient’s data transmission from a 
traveling ambulance, we try in the present article to 
evaluate both the stability of the connection made through 
the GSM mobile environment as well as the reliability of 
the coding scheme used by it. Moreover, a hierarchy of 
options, aiming at customizing the performance of the 
radio link, is proposed. 

The present paper is organized as follows: The 
experimental part is presented in Section II. In Section III 
solutions are proposed to the disconnection problem 
observed during the measurements, due to the low field on 
the GSM link. Finally, the service hierarchy is suggested. 
Conclusions appear in Section IV. 


I. Experiments 
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A. Test Bed 

The GSM through specially designed coding is able tc 
identify and correct, in most cases, data corrupted in the 
RF channel. In order us to control and evaluate the stability 
of a GSM connection as well as the reliability of the coding 
in retrieving corrupted data, we have built two cases ot 
client-server: a TCP/IP based and a UDP/IP based link, ir 
a 


Windows 98 environment. The WinSock API and Visual 
C++ 6 methods have been used. In both cases, the serve! 
was installed on a static Pentium MMX computer running 
at 166 MHz, with 32 MB RAM; while the client was 
running on a Pentium MM«X laptop at 200 MHz, witk 
32MB RAM. 

The server was connected to the PSTN through an externa! 
modem, whereas a GSM PCMCIA card at 9600 bps wa: 
the interface of the laptop to the mobile transmissior 
environment. 

Two sets of ten measurements each carried out in the same 
environment selected fortuitously. For the measurements 0! 
a set, a car moving always on the same path was carrying 
the client’s laptop, from which the transmission of a 30( 
KB bitmap file was triggered immediately after the cai 
started moving from the beginning towards the end of the 
path. A counter on the corresponding server applicatior 
recorded the time required for the receipt of the entire file. 
During the transmission the strength of the radio link field. 
given by the GSM PCMCIA card on the laptop’s screen. 
was observed. In all the disconnection cases the fielc 
strength reached its lower limit. Every time 4 
disconnection occurred at some point of the way, a new 
connection was established after the car’s return at its 
starting point. 


B. Measurements 

The measurements (Tables 1, 2) took place in twe 
successive evenings, in the Campus of the National 
Technical University of Athens at Zographou area. 


Table 1. 
TCPAP case (connection at 9600 bps, velocity 35 km/h, 
filelength to be sent 300 KB) 
Transmission Receiving Disconnection 
started athour Duration 

(min) 


KB received 
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19.20 4.758 no 300 
19.40 11.738 no 300 
RS) - yes 214 
20.10 - yes 205 
20.20 - yes 239 
20.35 - yes 187 
20.50 - yes 161 
21.45 - yes 197 
22.05 8.384 no 300 
22.20 - yes 210 
Table 2. 


UDP/IP case (connection at 9600 bps, velocity 35 km/h, 
filelength to be sent 300 KB) 


Transmission Receiving Disconnection KB received 
started athour Duration 
(min) 
19.45 5.446 no 300 
20.00 - yes 200 
20.10 5.521 no 300 
20.25 - yes 218 
20.40 5.464 no 300 
21.00 5.590 no 300 
Zilel'> - yes DD 
Zlk25 - yes 236 
225 - yes PAP 
22°35 - yes 208 


Note: Testing transmissions from the car been parked 
(motionless transmissions) for both the TCP/IP and 
UDP/IP cases carried out successfully. 


As regards both sets of measurements all the registered 
disconnections took place within the same 100 m portion 
of the path. 


C. Remarks 

The conclusions extracted from the measurement results of 
the Tables above are the following two: 

First, the convolutional coding used by the system at 
physical layer level for the TCH/F data channel seems to 
have worked adequately under the conditions the 
measurements took place. The integrity of the information 
is ensured, even in the cases in which the data were 
transmitted by means of a connectionless method, such as 
the UDP/IP protocol. Actually, data of periodic nature like 
for example the outputs of electrocardiographs or 
manometers, carried by an ambulance, could be 
transmitted over UDP/IP, which being “lighter” is faster 
than the TCP/IP. Because of the periodicity of this kind of 
data it is possible, up to a certain point, unrecoverable 
losses expected during connectionless transmission to be 
traded-off. 

Second, in the environment the present measurements 
carried out, the system seems to suffer of connection 
instability, because of inadequate radio coverage; to a great 
percentage the transmissions, in both TCP/IP and UDP/IP 
cases, did not carry through. But if this is the situation the 
remedy should apparently be the improvement of the field 
conditions. The questions then arise: Is it, practically, 


possible these conditions, actually related to landscape, 
buildings and structures in general, traffic, foliage etc. as 
well as combinations of them, to be checked and, if 
necessary, to be improved, in any micro-area of the mobile 
environment? Should the system introduce a transmission 
securing hierarchy, according to which the user can be 
assured, as much as possible, of the stability of his/her 
connection following the importance of the “payload” and 
the tariff paid for it? How the system should then allocate 
its resources? 


III. Proposed Solution 

The fact that the GSM specifications foresee the case of 
disconnections due to bad field conditions, providing an 
intrinsic mechanism to face the situation, in combination 
with the disconnections observed in our measurements led 
us to make the herein proposals, concerning the 
enhancement of the GSM’s disconnetion handling 
procedures. 


A. Current GSM Mechanisms 

Before going any further, to be reminded anyway the 
followings, according to the situation as it currently stands. 
i) In the GSM the signaling regarding the routine 
maintenance of a radio link is carried out through 
conversations between the MS (Mobile Station) and the 
BTS (Base Transceiver Station) over the Slow Associated 
Control Channel (SACCH). 

ii) Every time a call is normally terminated or it is 
interrupted, because of bad link conditions, the session 
release process is activated. This mechanism pertains to 
the Radio Resource Management protocol, which is a 
sublayer of layer 3 signaling protocol, as it has been 
adapted to deal with the radio environment demands. Its 
transactions are extended through the BTS and the Base 
Station Controller 

(BSC) up to the Mobile subscriber Switching Center 
(MSC). 

Actually, both the MS and the BTS monitor the loss of 
SACCH frames increasing a corresponding counter for 
every SACCH frame received and decoded, while 
decreasing it for every frame expected but not decoded. 
The value these counters are initialized is a characteristic 
of each cell and is referred in the specification [6] as the 
RADIO_LINK TIMEOUT parameter, with maximum 
value between 4 and 64. The RADIO LINK TIMEOUT is 
transmitted by the cell’s BTS on the Base Control Channel 
(BCCH) or passed with a SACCH message to the MS, 
when it enters a new cell after a handover. 
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Link's Quality Evaluation 
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Fig. 1 — Through the counting of the SACCH messages the 
quality of the link is evaluated. 


Whenever either of the counters becomes zero (Fig. 1) the 
link is considered broken and the release mechanism is 
activated. The MS returns to its idle mode, while the BSC 
returns the radio channel to the pool of the free physical 
resources (frequency — time slot pairs). Redialing and the 
consequent call re-establishment must be re-attempted in 
order the connection to be restored. 

B. GSM Improvements 

1. The customization of the RADIO LINK TIMEOUT 
parameter 

As already mentioned above, our measurements have 
experienced link disconnection, because of poor field 
conditions, within an interval of 100 meters. It takes about 
10.3 sec to go through these 100 meters for a car moving 
with 35 km/h. 

Taking into account that SACCH frames are sent down 
and uplink at a rate of 2 per second, our link in order to 
survive would be able to sustain, at the endpoint of the poor 
signal receipt, a maximum sequence of about 20 SACCH 
undecoded frames. On the basis of the above observations 
is made clear that, if the time interval in which these lost 
SACCH frames are tolerated (or equally the values at 
which the counters are initialized) was a subscriber’s 
option, the disconnection could be avoided. 

At this aim, the suggestion of the operator, who in any case 
is the most aware of network sensitive points of the kind 
we examine here, and/or any personal experience of the 
user, based on observations like the one we have just used, 
should be combined. The subscriber, finally, could decide 
what serves best his/hers necessities and specify the value 
of the RADIO LINK TIMEOUT parameter, under which 
his/hers data session should be treated by the network. 

The hassle of disconnection, the consequent redial for 
reconnection and the uncertainty of what data has been 
sent before disconnection and remains to be sent after it 
should take into account for a decision like this. 

2. The “hold transmission” mechanism for an 
unacknowledged transmission mode 

At this point the question of what happens with the data 
traffic arises. To be mentioned anyway that the SACCH is 
an in band signaling logical channel, so the loss of SACCH 
frames is an indication that user data has been lost too. 


Considering again the crucial time of 10.3 sec, the 
maximum amount of user data that could be lost is 
9600*10.3 = 98.9 Kbits or about 12.3 Kbytes. This could 
be a value considerably high, especially when an 
unacknowledged transmission mode is used, as is the case 
of the UDP/IP. In order to avoid such a loss, a process that 
holds the transmission of user data over the bad link could 
be introduced. It can be implemented using the counters of 
the SACCH frames. For these two counters, a threshold of 
expected but not decoded SACCH blocks, allowed 
successively uplink or downlink, should be defined. 
Whenever this threshold is passed at one of the endpoints 
of the link (the MS or the BTS), a mechanism, that holds 
at this endpoint the transmission of bursts carrying user 
data, is activated. Only the transmission of the SACCH 
carrying bursts is permitted (see Fig. 2), necessary for 
link’s quality evaluation. So, a possible handover decision 
is not barred, whilst the activation of the session release 
procedure as modified and given below is not excluded 

In the case of a bidirectional transmission, special 
signaling should be used. So, with the very first, properly 
coded, SACCH message from the endpoint that held the 
transmission, the other end is signaled to stop transmitting 
as well. If the receiving conditions at the latter end permit 
the receipt of such a signal the transmission is held there as 
well. Otherwise (signal not received because of bad link 
conditions) the “hold transmission” will be activated on 
this end too, when the corresponding threshold will be 
passed. 

As an example, if on one of the ends the threshold of two 
faulty SACCH frames is put, then the maximum amount of 
erroneous user data received before the hold transmission 
mechanism is triggered might be 9600*0.5*2 = 9600 bits 
or 1200 Bytes. Additional 9600bps * 0.5 = 4800 faulty bits 
might be received before the emission suspension on the 
other end, that is efore the arrival o the first SACCH 
message from the already muted end. Apparently this value 
is modified according to the threshold used. 


26 frames = 120 ms 


11 127 SS 23: 24025 
i) Traffic Channel Multiframe Structure (TCH/F) under 
Inormal transmission conditions 


no transmission ls | no transmission 


ii) When the transmission is held only SACCH are transmitted over 
he link 


= user data carrying burst, s = SACCH burst, i = idle (no data) 


Fig. 2 — When the “hold” user data transmission mechanism is 
activated only the SACCH data are transmitted. It takes four 
SACCH bursts for a SACCH message to be sent, or 480 ms per 
SACCH message. 
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On the other hand the transmission must be restarted when 
the conditions on the link are acceptable. At this aim 
another threshold must be introduced, which will define 
the number of the SACCH frames that must be successively 
received and decoded before the transmission is repeated 
(see Fig. 3). At the other end, in the case of a bi-directional 
transmission, the reactivation of the link and the receipt of 
the first traffic data trigger the restart transmission 
mechanism. 

Obviously, the values the thresholds under consideration 
must be given should be also a matter of negotiation 
between the service provider and the mobile subscriber. 


Link's Quality Evaluation 


; 
Hold transmission 


5 

4 

3 Restart transmis. 
A 

1 

0 


T Tr i T T =, 


OOS 1° 1:5) (2° 2.9) 13) 3!9' 4" 415) 5 5:5: 6) Gis 


time (sec) 


Fig. 3 — Hold and restart transmission thresholds (vertical lines 
at the indicative values of 1.5 and 3.5 sec). During the time 
interval between these two thresholds only the SACCH is 
transmitted 


3. The “keep up the connection active” mechanism 
However, a case still remains to be studied: What about if 


the expected improvement of the link conditions never 
comes, and despite the possible activation of the “hold 
transmission” mechanism, the SACCH message counter 
becomes zero? According to the situation as it stands, as 
we have already mentioned, this is the death of the 
connection, which is what we try to avoid by all means. At 
this aim the introduction of a time interval, which should 
be user defined, is proposed here. During this interval, and 
although either of the counters has reached zero value, the 
connection and the allocated resources are maintained, 
expecting the appearance of decodable SACCH messages 
(see Fig. 4). 
Link's Quality Evaluation 


Hold transmission 


T-maintain 


@ 
-.) 
a 
a 
o 
E 
=x 
Oo 
3 


T 


ORO St elo e220 on d.0 44°90) 515) 16. 6:5 


time (sec) 


Fig. 4 - Although the SACCH counter has become zero, the 
connection is kept up active for T-maintain seconds. The 
appearance, during this interval, of a predefined number of 
decodable SACCH messages (dashed lines) permits the revival of 


the connection; otherwise the session is definitely released. The 
procedure can be employed in combination with the other ones 
presented here. In this figure it is used with the “hold 
transmission” mechanism. 


However, the expiry of this time causes the definite 
disconnection, if in the meantime the annulled counter has 
not attained a value that could permit the reliable data 
retransmission. Such a value could be the proposed, in the 
above case 2, retransmission threshold, if the “hold 
transmission” is employed. 

So, if it is the MS’s counter the one that becomes zero, 
instead of starting the normal procedure to return to it’s 
idle state, the MS remains active for a user specified time 
interval expecting a link’s improvement. In the case, on 
the other hand, it is the BTS the part that has the annulled 
counter the traffic channel granted for the communication 
is maintained, keeping the connection alive for the same 
time. 

Obviously, the employment of the “keep up” time interval 
is independent of the “hold transmission” mechanism. 


C. The impacts on the upper layers 

Although the “hold transmission” mechanism can be 
proved a very helpful tool acting as a data saver in the 
cases of unacknowledged modes of communication, its 
compatibility with the upper layers, when acknowledged 
mode layers are employed, must be assured. This is the 
case of the TCP transmission in the present paper. 


— Of the four timers employed in a TCP connection [7], 


namely the retransmission , the persistence, the keepalive, 
and the 2MSL timer, the retransmission(maximum time 
limit 9 min) and the keepalive timer (maximum time limit 
12.5 min) deal with the availability of the physical 
medium. 

In fact it seems rather impossible such long times, of the 
order of minutes, to be used in our case. 


E. A Service Hierarchy 

A service hierarchy dealing with the disconnection over the 
radio interface could be introduced. According to it the 
user can select between the following options. 

1. The basic one, whose parameters are absolutely service 
provider defined. 

2. The option which permits the user to select the value of 
the RADIO LINK TIMEOUT parameter time, that is the 
time before disconnection because of bad link quality. 

3. The employment of the “hold transmission” procedure 
and the negotiation of the “hold” and “restart” 
transmission thresholds. 

4. The use of “keep up the connection active” option. 

It is obvious that the options 2, 3 and 4 can be used 
independently or in combination to each other. 


IV. Conclusions 

In the present paper the ability of the GSM to support data 
exchange between a traveling vehicle and a fixed station 
has been examined. During the measurements the 
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sufficiency of the physical layer coding scheme, the system 
uses, to carry out transmissions even over a connectionless 
protocol, like the UDP/IP, has been noticed. On the other 
hand the frequent disconnections due to radio link 
blackouts and the inevitable redialing for a new connection 
led to the suggestion of methods that could relieve this 
problem. On this goal the SACCH message counters, 
currently used for the activation of the session release 
mechanism in the case of bad link, are employed. 
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